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A colony of the proto- 
zoan species Choanoeca 
flexa. As a choanofla- 
gellate, C. flexa is one 
of the closest living 
relatives of animals. 
This species forms 
cup-shaped colonies 
that quickly invert 
their curvature in response to light-to- 
dark transitions by undergoing collective 
cell contractions. This behavior may offer 
insight into the evolutionary origin of 
tissue bending in animals. See pages 300 
and 326. Image: Thibaut Brunet (University 
of California Berkeley/Howard Hughes 
Medical Institute) 
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EDITORIAL 


Scientists and politics? 


he lower legislative chamber of the United 
Kingdom’s Parliament has 650 members, but 
only one (0.15%) has a science Ph.D. This 
seems like a surprisingly small number in a 
mature democracy. About 0.8% of people in 
the United Kingdom have a science Ph.D., so it 
appears that science is seriously underrepre- 
sented. I suspect it is the same the world over. Why 
is this, is it right, and what are the consequences? 
There is, fortunately, an increasing focus on making 
governments representative of the diversity of the pop- 
ulation they serve in terms of gender, race, and sexual 
orientation. But diversity also 
needs to embrace different 
intellectual approaches. The 
structured thinking and dis- 
ciplined methodologies of sci- 
ence add to diversity, but these 
are aspects that can challenge 
vested interests. The blunt, 
socially insensitive, scientist 
speaking truth to power is 


certainly a caricature, but it 
is sufficiently real to warrant 
careful management by gov- 
ernments. There is also often 
suspicion that scientists oper- 
ate their own agendas. 

Vested interests do not 
want their political, social, 
and financial currency de- 
based by being confronted by 
the real world. Governments 
have, therefore, tended to put 
scientists in a metaphorical 
box and to only open the lid 
when they are needed, thus reducing science to a tech- 
nical service function, supplying support, advice, and 
economic goods. These boxes can take many forms, from 
the containment of advisers with a set of rules to forums 
created for science to play within, like the Intergovern- 
mental Panel on Climate Change (IPCC). Until recently, 
I was chief scientific adviser in the U.K. government on 
food and environment, a role that only slightly corrected 
for poor integration of science within government. 

It is this systematic exclusion that leads to the under- 
representation of scientists in politics and government. 
Scientific culture is dominated by human social culture 
and is forced to play by its rules. As C. P. Snow said 
60 years ago, “This polarisation is sheer loss to us all.” 
Politics would look very different if it were more evenly 


“,Systematic exclusion...leads to 
the underrepresentation of scientists 
in politics and government.” 


balanced between these cultures. Aristotle saw politics 
as legislative science or learning by experience. Politics 
was itself a scientific inquiry, thus reversing the cur- 
rent maxim that science lies outside politics. In Aris- 
totle’s world, scientists would be those who designed 
policies to solve tricky problems. 

The modern idiom of scientists as the custodians 
and discoverers of knowledge is much too restrictive. 
When contested issues arise, society needs scientists to 
be dominant voices acting as authoritative, impartial, 
and trusted arbiters who can explain where uncertain- 
ties lie in evidence and the likely consequences of al- 
ternative futures. 

Unfortunately, with the 
possible exception of climate 
change (helped by the IPCC), 
scientists are left to struggle 
to form a politically influen- 
tial caucus working within its 
own rules. The alternative is 
capture by the language and 
ways of the other culture. 
Under this scenario, science 
becomes yet another money- 
grabbing vested interest. As 
a result, many of those who 
would most benefit from lis- 
tening put scientists in an 
opposing political box and 
close the lid. Advocacy is the 
surest and most rapid way to 
achieve such an effect. It can 
compromise the impartiality 
of scientific evidence and do 
more harm than good. When 
science becomes captured by 
normal politics, its value drains away. 

Unless the scientific community grips this problem 
of systemic disempowerment within government and 
politics, science will continue to be manipulated within 
a political game dominated by vested interests. The un- 
derrepresentation of science seems likely to continue 
unless the process of government itself becomes more 
diverse. Scientists themselves can help by demonstrat- 
ing greater thought leadership, showing more enthu- 
siasm to become part of government in all its forms, 
and valuing the contributions of colleagues who get 
involved. They need to break out of their synthetic box 
and politely refuse to have the lid closed on them. 


-lan L. Boyd 
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46 |t'll have to be bigger, brighter and pay better. 99 


Statistician Adrian Smith, author of a forthcoming report 
recommending the United Kingdom create an alternative to the European Research 
Council if U.K. scientists cannot receive its funding post-Brexit. 
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The InSight lander’s robot arm, shown hovering over its heat probe, is helping it penetrate the surface. 


PLANETARY SCIENCE 


Mars lander resumes hammering probe 


ASA last week reported progress in getting the stuck heat probe 

on its InSight lander to resume hammering itself below the mar- 

tian surface. During the previous 9 months, the probe, designed 

to measure heat flow from Mars’s interior, had penetrated only 

35 centimeters instead of the planned 5 meters. The lumpy soil 

surrounding the hole does not cave into it as the probe tip ham- 
mers, preventing the probe from gaining the traction necessary for pen- 
etration. After rescue attempts failed this summer, NASA and scientists 
from the German Aerospace Center (DLR), which designed the probe, 
finally decided to use InSight’s robotic arm to pin the heat probe, called 
a mole, against the side of its hole, increasing friction. Hammering re- 
sumed last week, and the mole had already penetrated 3 centimeters 
more when Science went to press, DLR said. 
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Econ experimenters win Nobel 


PRIZES | This year’s Nobel Memorial Prize 
in Economic Sciences honors three pio- 
neers of the use of randomized controlled 
trials to determine how best to ameliorate 
global poverty. Michael Kremer of Harvard 
University and Abhijit Banerjee and Esther 
Duflo of the Massachusetts Institute of 
Technology in Cambridge have often worked 
together to test antipoverty interventions 

in education, health, agriculture, and access 
to credit, mainly in sub-Saharan Africa and 
India. The experimental approach in eco- 
nomics is labor intensive and faces questions 
about whether effects observed in such trials 
hold over the long run. Duflo, 46, is just the 
second woman to win the economics Nobel 
and the only woman among the 14 Nobel 
laureates this year. 


Chemical attack paper vetoed 


WARFARE | In an about-face, the journal 
Science & Global Security decided last week 
not to publish a controversial article that 
casts doubt on the Syrian government’s 
responsibility for a 2017 chemical attack 
that killed more than 80 people. Scientists 
had accused one of the authors, professor 
emeritus Ted Postol of the Massachusetts 
Institute of Technology in Cambridge, of 
pushing conspiracy theories. The journal’s 
editors wrote on its website that they cannot 
“rectify” problems with the manuscript or 
provide it “an independent, fair, effective, 
and conclusive blind peer review.” Postol 
called the decision “totally wrong” and said 
he would resign from the journal’s editorial 
board. (He says he has not been involved in 
the journal’s deliberations about the paper.) 
An email from the editors to the authors, 
which Postol shared with Science, says the 
paper was sent to two reviewers who were 
divided, but does not provide details. 


Theft of biblical texts alleged 


ANTIQUITIES | Dirk Obbink, a professor 
at the University of Oxford in the United 
Kingdom, was accused this week of steal- 
ing 11 ancient Egyptian papyrus fragments 
from a scholarly collection he directed 
and selling them to Hobby Lobby Stores, 
the U.S. craft chain run by the evangelical 
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Alfred Maudslay 

staircase's glyphs and made other records of 
Palenque and other Maya cities. Rain and sun 
have since taken a toll on the staircase. Over 


the past 3 years, the British Museum in London 


worked with Google Arts & Culture to digitally 
scan Maudslay’s archive, which the museum 
holds. The scans (shown in a digital model, 
right) were then used to mill exact replicas of 
the glyphs from limestone, the same material 
used for the original. 


Green family. The fragments, all inscribed 
with biblical texts, ended up at the 
Museum of the Bible in Washington, D.C., 
which the Greens built in part to display 
their collection of such artifacts. That 
museum and the Egypt Exploration Society 
(EES), a London nonprofit that originally 
owned the fragments, found that 13 frag- 
ments had mysteriously gone missing from 
EES; 11 can be traced to sales by Obbink to 
Hobby Lobby, EES reported on 14 October. 
The museum says it will return all 13 frag- 
ments to EES. Efforts to reach Obbink for 
comment were unsuccessful. 


Vision loss goes unaddressed 


PUBLIC HEALTH | At least 1 billion people 
live with a moderate to severe vision 
impairment, such as glaucoma or far- 
sightedness, that is uncorrected or could 
have been prevented, says a report last week 
by the World Health Organization (WHO). 
The burden is disproportionately high in 


A Avoidable impairment 


Millions of people 


Age-related farsightedness IS 826 


Refractive errors, e.g. myopia IJ 124 
Cataract [J 65 
Glaucoma | 7 
Corneal opacities | 4 
Diabetic retinopathy | 3 


Trachoma | 2 
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Uncorrected myopia 
causes an estimated 

$244 billion in annual global 
productivity losses; 
uncorrected age-related 
farsightedness, $25 billion. 


women and other specific populations, 
including people in rural areas and low- and 
middle-income countries; poor residents 

of sub-Saharan Africa and South Asia have 
rates of blindness eight times higher than 
people in high-income countries, WHO says. 
The number of unaddressed vision problems 
will likely increase significantly in coming 
years as the world’s population ages and 
more people move to cities, the report says; 
urban living is associated with increased 
rates of nearsightedness. More than 

$20 billion would be needed to correct or 
prevent the unaddressed vision impair- 
ments. The report recommends including 
vision care in primary health care settings 
and national health plans. 


India tightens China exchanges 


INTERNATIONAL AFFAIRS | India moved 
last week to scrutinize its collaborative 
research agreements with institutions 

in China. Under a new policy from 

the University Grants 
Commission, which 
regulates more than 900 
institutions of higher edu- 
cation, India’s ministries 
of Home Affairs and 
External Affairs need to 
approve such deals in the 
future. Scientists fear the 
policy, triggered by tensions 
over borders and growing 
mistrust between the coun- 
tries, will further squeeze 
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what is already a trickle of such exchanges: 
From 1995 to 2018, only 88 Indian scientists 
visited China and 73 Chinese researchers 
came to India under a program run by the 
countries’ national science academies. 

(A small number of additional exchanges 
have occurred under other auspices.) 


Congo measles cases lead globe 


INFECTIOUS DISEASE | The Democratic 
Republic of the Congo (DRC), already 
struggling with an outbreak of the Ebola 
virus, is also contending with the world’s 
largest current surge of measles, with more 
than 203,000 cases so far this year, triple 
its 2018 total, the United Nations Children’s 
Fund (UNICEF) announced on 9 October. 
Children under age 5 are bearing the brunt 
of the epidemic, accounting for 74% of infec- 
tions and nearly 90% of the country’s 
4096 measles deaths this year. (By com- 
parison, 2143 people have died from Ebola 
since the current outbreak emerged in the 
summer of 2018.) In 2018, measles vaccina- 
tion coverage in the DRC was just 57%; 
about 95% is required to prevent measles 
outbreaks. Weak health care infrastructure 
and community mistrust of vaccines and 
vaccinators have contributed to the poor 
coverage, UNICEF said. 


U.S. STDs break record 


PUBLIC HEALTH | Combined total U.S. 
cases of the three most commonly reported 
sexually transmitted diseases (STDs) 


18 OCTOBER 2019 + VOL 366 ISSUE 6463 287 


NEWS | IN BRIEF 


a 


Abias in Nobels to women 


This year, only one woman won a Nobel 
Prize in a scientific field, economics. 
Since the awards were first given in 1901, 
women have taken home just 22 Nobels, 
about 3% of the total. Is that because 
women were simply underrepresented in 
research fields years or decades before 
the prizes were given? According 

to physicist Liselotte Jauffred of the 
University of Copenhagen, the answer 

is no. With two colleagues, she published 
an analysis in May in Palgrave Commu- 
nications showing that bias, not under- 
representation, almost certainly accounts 
for the scarcity of women Nobel winners. 


Q: How did you do your analysis? 

A: We assumed that among all tenured 
faculty members, everybody would have 
the same chance to enter this category 
of really esteemed researchers. We 
sampled a pool of [U.S. National Science 
Foundation data on] faculty members and 
said that if there is a 10% chance of the 
faculty members that are women, then 
we would think that the success rate of 
women to get a Nobel Prize would also be 
10%. Then we looked at the success rate 
of women in getting the Nobel Prize and 
see that it is much, much lower than what 
the gender ratio suggests. 


Q: Can your study say anything about 
the sources of bias? 

A: Our study is purely a statistical study 
that says that there is a bias. [But] | don’t 
actually think it's so much the Nobel 
Committee that is implementing this bias. 
| think it’s something that happens in 
multiple earlier steps. 


Q: The Royal Swedish Academy of 
Sciences said it has increased the num- 
ber of female nominees for the prizes. 
Are there signs of progress? 

A: We cannot say anything about trends 
based ona single year. We did make 
avery small Monte Carlo simulation, 
though. Every year, the chance that the 
winners will be all men in chemistry, 
economics, and physics is over 

80%, and medicine is around 60%. ... 
Inthe last 20 years, there have been 
about the same number of women 
Nobel laureates as in the first 100 years. 
It’s progressing in the right direction, 
but not fast enough. 
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climbed to an all-time high of nearly 

2.5 million in 2018, the Centers for Disease 
Control and Prevention (CDC) announced 
on 8 October. Cases of chlamydia grew 

by 3% from the 2017 level, to more than 
1.7 million, the most ever. Gonorrhea cases 
increased 5%, to 583,405, the highest since 
1991. And cases of early- and middle-stage 
syphilis, the most infectious stages of the 
disease, grew 14% to 35,063, also the most 
since 1991. Ninety-four newborns died 
from congenital syphilis, which is passed 
from mothers to babies during pregnancy, 
up 22%. All three diseases are readily 
treatable with antibiotics. CDC said the 
increases were caused in part by cuts to 
STD screening and treatment programs 
and decreased condom use. 


Untreated, the chlamydia bacterium can cause 
infertility in women. 


Abid to renew stem cell fund 


RESEARCH BUDGETS | As California’s $3 
billion stem cell agency prepares to dole out 
the last research awards from its dwindling 
funds, the investment banker behind the 
2004 ballot initiative that created it has 
formally launched a new effort to keep it 
alive. As anticipated, Robert Klein, chair- 
man of the advocacy group Americans for 
Cures in Palo Alto, California, submitted a 
new ballot measure last week that would 
devote another $5.5 billion from bond sales 
to the California Institute for Regenerative 
Medicine (CIRM), which stopped accepting 
nearly all new award applications at the end 
of June. The measure includes proposals 

to increase public access to potential thera- 
pies generated by CIRM-funded research, 
including the creation of new centers 

to host clinical trials across California. If 
the petition can win the required 623,212 
signatures, it will appear on the ballot in 
November 2020. 


Ex-EPA air panel weighs in 


POLLUTION | The U.S. Environmental 
Protection Agency (EPA) should tighten 

its limits on particulate air pollution, an 
unusual group of 20 former agency science 
advisers has concluded. The experts had 
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served on an official EPA advisory panel that 
the agency disbanded last year, saying it was 
no longer needed. Critics said the dismissal 
was aimed at easing Trump administration 
efforts to weaken particulate limits, and 
some former panelists decided to indepen- 
dently continue their work with help from 
the Union of Concerned Scientists, an advo- 
cacy group in Cambridge, Massachusetts. 
After meeting on 10 and 11 October near 
Washington, D.C., the group expects to 
recommend that EPA cut its main fine- 
particulate limit from 12 micrograms per 
cubic meter (\1g/m*) of air to between 8 and 
10 ug/m’, EXE News reported. EPA 

can ignore the advice, but the former advis- 
ers hope their work will bolster legal efforts 
to force the agency to impose tighter limits. 


Psychology studies criticized 


SCHOLARSHIP | An investigation by King’s 
College London has deemed 26 papers 

by the late Hans Eysenck, a prominent U.K. 
psychologist, “unsafe,” and the university 
has recommended they be retracted, The 
Guardian reports. Eysenck’s controversial 
theories, many formed in collaboration with 
German researcher Ronald Grossarth- 
Maticek in the mid-20th century, included 
his belief that people with certain personality 
types are more prone to cancer and heart 
disease, and that smoking might not actually 
be carcinogenic. Other experts criticized his 
publications, which were nevertheless widely 
cited—Eysenck ranked third in the social 
sciences citation index behind Sigmund 
Freud and Karl Marx. The investigation 

was prompted by a paper published in 
February by psychiatrist Anthony Pelosi of 
Priory Hospital in Glasgow, U.K., criticizing 
Eysenck’s work. 


Drexel settles bar bill case 


OVERSIGHT | An investigation by Drexel 
University found that its former head of 
electrical and computer engineering spent 
$189,062 of federal grant money at strip 
clubs and sports bars and for other non- 
work-related expenses over 10 years. Drexel, 
in Philadelphia, Pennsylvania, told the 
government that Chikaodinaka Nwankpa 
charged bills at Club Risque and other estab- 
lishments from 2007 to 2017 to eight federal 
grants for energy and naval technology 
research, the U.S. Attorney’s Office for the 
Eastern District of Pennsylvania announced 
7 October. In a settlement with the govern- 
ment, Drexel agreed to pay back the money; 
Nwankpa resigned his position, repaid the 
university $53,328, and was barred from fed- 
eral government contracting for 6 months, 
the U.S. Attorney’s Office said. 
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The Biosphere 2 rainforest greenhouse, built in Arizona in the late 1980s, contains 90 plant species. 


Drought test begins in Biosphere 2 rainforest 


Tracing carbon in closed ecosystem could reveal tree and soil response to water stress 


By Erik Stokstad 


arlier this month, the doors to the 

tropical rainforest, enclosed under 

a ziggurat of glass, were sealed shut. 

Christiane Werner turned a valve to 

release about $12,000 worth of carbon 

dioxide (CO,) spiked with carbon-13, 
an isotope that is normally scarce in the at- 
mosphere. The luxuriant plants inside Bio- 
sphere 2, a 30-year-old set of greenhouses 
and artificial ecosystems in the Arizona 
desert, soaked up the isotopic tracer, en- 
abling investigators to follow the flows of 
carbon through the healthy forest. Werner, 
an ecosystem physiologist at the University 
of Freiburg in Germany, and her team gath- 
ered these baseline data for the harsh test 
to come: the largest forest drought experi- 
ment ever monitored with isotopes. “It will 
be amazing to see the results,” says Tamir 
Klein, a plant ecophysiologist at the Weiz- 
mann Institute of Science in Rehovot, Is- 
rael, who is not involved. 

On 7 October, the researchers shut off 
the sprinklers that irrigate the rainforest, 
beginning a 6-week drought. Next month, 
they will inject another pulse of isotopi- 
cally enriched CO, into the densely instru- 
mented ecosystem, and apply other tracers. 
A forest’s consumption of CO, slows during 
drought, but scientists haven’t pinned down 
how thirsty rainforest plants—especially 
large trees—use and release their stored 
carbon. The answers are important for the 
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global climate cycle, Klein says. Droughts, 
expected to become more severe as the 
climate warms, could turn tropical forests 
from sinks of greenhouse gases into sources 
that accelerate climate change. 

Field experiments in the Amazon, in 
which plastic panels intercept rain to keep 
large swaths of forest dry, have sketched 
out how drought kills trees of different sizes 
(Science, 15 April 2005, p. 346). Smaller 
studies targeting individual plants with iso- 
topic tracers have revealed some of the im- 
pacts on plant function. But the Biosphere 
2 experiment will do both by applying trac- 
ers across an entire forest. “We have an eco- 
system in a lab,” Werner says. 

The $150 million Biosphere 2 was built 
in the late 1980s as a kind of spaceship on 
Earth, in which humans would attempt 
to survive inside a sealed ecosystem. That 
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mission flopped, but the University of Ari- 
zona now operates the facility for research, 
education, and tourism. It has hosted large 
ecology studies and an ongoing $3 mil- 
lion experiment in landscape evolution 
(Science, 14 December 2012, p. 1417). Bio- 
sphere 2’s original funder, financier Edward 
Bass, helped support that earlier work, but 
much of the new experiment is funded by 
part of a €1.9 million grant Werner won 
from the European Research Council. About 
50 researchers from 13 institutions are con- 
tributing equipment and expertise. 

The focus is Biosphere 2’s tropical for- 
est, which includes some 90 plant species 
across an area the size of seven tennis courts. 
All summer, the team prepared by building 
canopy platforms where they could enclose 
dozens of leaves and stems in small chambers 
to capture their emissions. They drilled into 
tree trunks to insert probes, and dug obser- 
vation pits to measure emissions from soil 
and roots. Four kilometers of tubing carry 
gases from the probes to a room full of in- 
struments. “The scale of measurements on 
this drought is completely unparalleled,’ says 
co-leader Laura Meredith, a biogeochemist at 
the University of Arizona in Tucson and di- 
rector of rainforest research at Biosphere 2. 

By tracking the carbon-13, the research- 
ers will learn how quickly carbon is taken 
up during photosynthesis and then moves 
through the forest. They will compare 
those rates before and during the drought 
across six tree species that differ in their 
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drought resistance. And they will learn how 
the trees apportion stored carbon in their 
leaves, trunks, and roots. It’s a “huge black 
box,” and crucial for predicting how plants 
respond to stresses like drought, says plant 
physiologist William Anderegg of the Uni- 
versity of Utah in Salt Lake City. 

Another set of tracers will show in finer 
detail how particular metabolic pathways 
use carbon. During the past month, the 
researchers have supplied a solution of 
isotopically enriched pyruvate, a chemi- 
cal building block used in many biological 
processes, to leaves, roots, and clumps of 
soil. One type of pyruvate tracer reveals 
how much carbon is given off during day- 
time respiration—a key part of the carbon 
cycle that needs to be better quantified, 
Werner says. 

Another pyruvate tracer, taken up into a 
different pathway, shows how much carbon 
the plants and soil microbes use to synthe- 
size volatile organic compounds (VOCs). 
When plants are stressed, these chemicals 
make up a considerable fraction of their 
carbon emissions. They can warm the at- 
mosphere or turn into aerosols that cool it, 
but their overall climatic effect is unknown. 
Plants use VOCs for many purposes, includ- 
ing as a homing signal for a vast web of soil 
fungi that provide water and nutrients to 
roots during drought. The researchers hope 
to quantify rates and amounts of VOCs ex- 
changed between the microbes and plants 
and whether they change during drought. 

At the end of the drought, the research- 
ers will perform one last tracer experiment, 
irrigating the deep soil with water enriched 
in an isotope of hydrogen. They expect large 
trees to take up most of the water, and they 
hope to learn whether their deep root sys- 
tems will leak some of the water into the 
shallow soil, helping smaller plants recover. 

Finally, the sprinklers will turn on and 
return the ecosystem to normal. When 
parched soil and fallen leaves are rewetted, 
microbes go into metabolic overdrive and 
churn out CO, and VOCs. Meredith and her 
colleagues will measure emissions and link 
them to patterns in microbial genes. 

Ultimately, results from the drought test 
will improve the way global climate models 
account for vegetation. “You need these ex- 
periments to unlock the physiology and add 
it into the models,” Anderegg says. “It gets 
us much more mechanistic and rigorous 
projections of how tropical trees and forests 
might respond to climate change.” 

After the experiment wraps up, tourists 
will be let into the rainforest again. But the 
canopy platforms will remain for future re- 
search, and some of the carbon tracers will 
also stick around. “We can look for the sig- 
nal for years to come,” Meredith says. 
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NIH ‘high risk, high reward’ 
awardees skew male—again 


Agency struggles to attract female applicants, keep review 


process gender neutral 


By Meredith Wadman 


hen the U.S. National Institutes 
of Health (NIH) announces each 
year’s winners of plum “high risk, 
high reward” (HRHR) awards, 
which go to about 100 scientists 
doing outside-the-box research, 
the paucity of female winners regularly 
raises eyebrows. Despite NIH’s efforts to 
improve the picture, this year’s winners, an- 
nounced on 1 October, were no exception. 

Although women won three of the four 
types of HRHR awards in numbers that met 
or exceeded their representation in the ap- 
plicant pool, that representation was mea- 
ger: For all HRHR awards, 18% to 38% of 
applicants were women, although women 
have earned more than 50% of U.S. Ph.D.s 
in biological sciences since 2008. What’s 
more, for the Early Indepen- 
dence Awards (EIAs)—which 
can boost women’s careers 
at a crucial time by allowing 
new Ph.D.s to skip a postdoc 
and start an independent lab 
immediately—women _consti- 
tuted 38% of applicants, but 
only 25% of awardees. 

The EIA sample size is small: 
Among the 13 new award- 
ees, three were women. Still, 
women have been underrepresented among 
EIA winners in eight of the 9 years since the 
award’s launch. “In aggregate over all the 
years, there has been a significant bias,” 
says Kristin Knouse, a 2018 EIA winner 
who is a cell biologist at the Whitehead 
Institute for Biomedical Research in Cam- 
bridge, Massachusetts. “There needs to be 
a systematic examination of where this 
awardee bias is arising.” 

But Olivia Corradin, also at the White- 
head Institute, who in 2017 won a similar 
early-investigator award for innovative re- 
search from NIH, cautions against reading 
too much into the data. “You could pull 
the same analysis for the [HRHR] Pioneer 
Award and you might conclude that they 
are overrepresenting women.” 

Women have won Pioneer Awards in num- 
bers that met or exceeded their representa- 


of Health 


Published by AAAS 


“We take this 
pretty seriously. 
We gotta change 
these numbers.” 


James Anderson, 
National Institutes 


tion in the applicant pool in seven of the past 
9 years. This year, 18% of applicants—and 
45% of awardees—were women. 

The low numbers of women applying for 
all four awards are the main concern, says 
James Anderson, who directs NIH’s Divi- 
sion of Program Coordination, Planning, 
and Strategic Initiatives, which supports 
the HRHR awards. “We need to focus on 
encouraging more women to apply,” he 
says. “We take this pretty seriously. We 
gotta change these numbers.” (For the two 
other types of HRHR awards this year, the 
Transformative Research and New Innova- 
tor awards, women were 20% and 30% of 
applicants, respectively, and won 24% and 
33% of awards, respectively.) 

In 2018, concerned about disparities 
among HRHR applicants, NIH Director 
Francis Collins convened a working group 
to review the program. Their 
final report, issued this past 
June, urged vigorous outreach 
institutions and _ popula- 
tions that historically have 
not applied for HRHR awards 
in large numbers. The report 
also flagged a bias in favor of 
awards to scientists at high- 
profile institutions, and for 
proposals to study mechanistic 
and subcellular topics over, for 
instance, behavioral and clinical research. 

In April, NIH tweaked its call for 2020 ap- 
plications. “We strengthened our inclusivity 
language,” says HRHR Program Leader Ravi 
Basavappa, to make clear that the agency 
welcomes research from the whole spec- 
trum of institutions, all sorts of applicants, 
and any topics within its mission. 

Both the stubbornly low numbers of 
women applying for HRHR awards and 
their poor record at winning EIAs show 
“just how difficult change is,” says Molly 
Carnes, a working group member who is 
an expert on implicit bias and a professor 
of women’s health at the University of Wis- 
consin in Madison. “We live gendered lives. 
When you evaluate men and women’s sci- 
ence, their science is filtered through the 
fact that they are men and women. That’s 
how the human mind works.” 
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Sauropods get a new diet and a new look 


The behemoths may have chomped nutritious horsetails—with the help of sturdy beaks 


By John Pickrell, in Brisbane, Australia 


ow did sauropod dinosaurs, the big- 
gest creatures ever to have thundered 
across Earth, bulk up to the weight of 
more than 10 African bull elephants 
on a spartan diet of prehistoric 
greens? Many herbivores today grow 
fat on energy-rich grasses, but these and 
other nutritious flowering plants didn’t be- 
come common until near the end of the dino- 
saurs’ reign. Now, researchers think they have 
glimpsed the answer: a surprisingly nutrient- 
rich plant that could have been a mainstay 
of these dinosaurs’ diets, and turtlelike beaks 
that buttressed sauropods’ peglike teeth as 
they relentlessly stripped foliage from plants. 

“We are seeing that they were able to ac- 
quire large volumes of food rapidly, and 
possibly nutrient-rich food,’ says Stephen 
Poropat, a paleontologist at Swinburne Uni- 
versity in Melbourne, Australia. 

Researchers at the University of Bonn in 
Germany presented the findings last week at 
the annual meeting of the Society of Verte- 
brate Paleontology here. Their previous work 
helped show that sauropods were eating ma- 
chines that gulped down vegetation without 
chewing. They swung their long necks over 
vast areas, like prehistoric lawn mowers, 
while saving energy by keeping their bodies 
in one spot. The new studies add detail by 
exploring the beasts’ diet and jaw structure. 

One study, by Bonn’s Carole Gee and her 
colleagues, identified what may have been 
the superfood of the dinosaur era. Her team 
zeroed in on the nutritional content of low- 
growing, spore-bearing horsetails, or Equi- 
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setum, which were widespread during the 
Jurassic period and still grow today. 

Horsetails appeared to be poor fodder 
in previous tests, which simply burned the 
plants to measure carbon content, Gee says. 
Instead, her team adapted the Hohenheim 
gas test, a method for assessing the quality 
of fodder for farm animals. They fermented 
modern horsetails for 3 days to simulate 
the journey through a sauropod’s gut and 
measured the volume of gas produced—an 
indicator of energy content. The research- 
ers were astounded to find that horsetails 
released more energy than any other plant 
group, including 16 modern grasses. Equi- 
setum is rich in protein, they say, and far 
more nutritious than the ferns, cycads, and 
conifers common in the dinosaur era. Gee 
argues that horsetails by rivers and lakes 
would have offered sauropods, especially 
young ones, “a plentiful, accessible, and ex- 
tremely nutritious food.” 

Even eating superfoods, sauropods must 
have vacuumed up as much as 1 ton or more 
of plant matter per day. To understand the 
mechanics, another Bonn team analyzed the 
beasts’ jaws. 

For decades, paleontologists have dug up 
puzzling fossils: rows of isolated sauropod 
teeth, still neatly arranged as they would have 
been in the mouth, with not a scrap of bone 
encasing them. “There must have been some- 
thing holding them in place,’ Bonn’s Kayleigh 
Wiersma says. “Otherwise they would have 
been scattered all around the dig site.” 

In some skulls that retain teeth, they seem 
to be almost falling out of the sockets. “You 
can’t have teeth exposed to that degree,” says 
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Steve Salisbury, a paleontologist at the Uni- 
versity of Queensland here. “It seems likely 
there would have been some tissue that en- 
closed the base at least.” 

Wiersma and research leader Martin 
Sander studied seven sets of tooth rows, with 
up to 40 teeth each, from sauropods includ- 
ing Diplodocus, Brachiosaurus, and Apato- 
saurus. Wiersma said at the meeting that 
paleontologists may have had the faces of 
these iconic dinosaurs all wrong: Instead of 
the lizardlike lips shown in artists’ concepts, 
the behemoths likely sported beaks as well as 
teeth, unlike any living animal. 

The researchers found that sauropod teeth 
typically show surface wear only about half- 
way down to the jaw. That suggests the teeth 
were once deeply embedded in a supporting 
structure. The researchers also found tiny 
pits in the surface of the jaw, perhaps indi- 
cating blood vessels to nourish beaks. Similar 
pits and foramina are seen in other dino- 
saur bones near inferred structures, like the 
sheaths covering horns, thought to have been 
made of keratin, which forms our nails and 
birds’ beaks and feathers. 

Poropat cautions that expanded gum tis- 
sue, rather than a beak, might have held the 
teeth in place. But bony scaffolding that sup- 
ported beaks is obvious in other dinosaurs, 
including Triceratops, Stegosaurus, and 
duck-billed hadrosaurs. Nevertheless, “I don’t 
think we would have expected sauropods to 
have beaks,” says Darren Naish, a paleontolo- 
gist at the University of Southampton in the 
United Kingdom. It’s a “whole new look.” & 


John Pickrell is a journalist in Sydney, Australia. 
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Lithium-ion battery development takes Nobel 


Three chemists honored for creating powerful, light, rechargeable cells 


By Robert F. Service 


ou probably have the work of Nobel 

Prize winners in your pocket. This 

year’s Nobel Prize in Chemistry was 

awarded last week to the pioneers of 

the lithium-ion battery, an invention 

that has become ubiquitous in the 
wireless electronics that permeate mod- 
ern life: your phone, tablet, laptop, and 
perhaps even your car. Lighter and more 
compact than the rechargeable batteries 
that preceded them, lithium-ion batteries 
are now moving beyond gadgets to power 
homes, airplanes, and even the electric 
grid that feeds power to entire cities. 

Three chemists will split the $900,000 
prize: Stanley Whittingham at the State 
University of New York in Bing- 
hamton, John Goodenough at the 
University of Texas in Austin, and 
Akira Yoshino at Asahi Kasei Cor- 
poration in Tokyo. 

“T am thrilled,” says Yang Shao- 
Horn, an electrochemist at the 
Massachusetts Institute of Technol- 
ogy in Cambridge. “Lithium-ion 
batteries have made a tremendous 
impact on our society,’ adds Kristin 
Persson, a materials chemist at the 
University of California, Berkeley: 
“All three [awardees] deserved it. 
They provided critically important 
components to make lithium-ion 
batteries work.” 

Virtually all batteries have three 
essential components: two elec- 
trodes—an anode and a cathode— 
separated by an electrolyte. In to- 
day’s lithium-ion cells, the electrolyte is typi- 
cally a liquid that allows lithium ions to move 
back and forth between the electrodes. When 
the battery is providing electricity, lithium 
atoms at the anode give up electrons that 
flow through an external wire. The posi- 
tively charged lithium ions left behind mi- 
grate through the electrolyte to the cathode, 
where they nestle between the cathode’s 
layers of metal oxide materials. Applying an 
electric voltage to charge the battery reverses 
the flow, pushing the lithium ions out of the 
metal oxide, through the electrolyte, and back 
to the anode, where they retrieve electrons. 

In the 1970s, when Whittingham was 
working at Exxon, all rechargeable batteries 
used other, less efficient electrode materials: 
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lead, nickel, cadmium. Whittingham tried 
something new: pairing a cathode made of 
a layered material called tantalum disulfide 
(TaS2) with an anode made from lithium 
metal, which readily gives up electrons and 
forms ions that can wedge into the TaS2 lay- 
ers. The resulting battery could store lots of 
energy. But Exxon couldn’t commercialize it 
because repeated charging deposited thin 
whiskers of lithium, known as dendrites, on 
the anode, causing some batteries to short 
out and catch fire. 

Goodenough, then at the University of 
Oxford in the United Kingdom, picked up 
the mantle. He realized the cathode could 
soak up more electrons if it was made of a 
metal oxide instead of a metal sulfide. Some 
of those proved unstable. But he also found 


The 1970s oil crisis spurred interest in rechargeable batteries; today’s electric 
car batteries use technology launched from the chemistry Nobelists’ ideas. 


that adding cobalt to the cathode helped 
stabilize its layers. In 1980, he described 
a rechargeable battery that could produce 
4 volts, making it nearly twice as powerful 
as Whittingham’s. 

Still, dendrite formation on lithium 
metal anodes remained a major problem. 
Researchers had shown they could make 
anodes out of graphite instead, because 
lithium atoms could wedge themselves 
between the graphite’s carbon layers. But 
the graphite invariably broke down in the 
battery’s electrolyte. In the mid-1980s, 
Yoshino found that if he used an electro- 
lyte containing ethylene carbonate, a thin 
insulating layer formed on the graphite 
surface, protecting it from degradation. 


Published by AAAS 


“T just sort of sniffed out the direction 
the trends were moving,” Yoshino says. By 
1991, Asahi Kasei and Sony had released 
the first commercial lithium-ion battery, 
which powered portable music players, 
among other devices. 

Those initial cells could store twice as 
much energy as the previous best recharge- 
able batteries. Further improvements have 
boosted the energy storage capacity of 
lithium-ion cells another threefold, notes 
George Crabtree, director of the Joint 
Center for Energy Storage Research at Ar- 
gonne National Laboratory in Lemont, I- 
linois. “The progress has been absolutely 
stunning,” he says. 

It’s likely not over. “There’s still plenty 
of room for improvement,” Persson says. 
For starters, battery research- 
ers are working feverishly to do 
away with cobalt, an expensive 
metal often mined by child labor- 
ers in the Democratic Republic of 
the Congo, which supplies some 
70% of the world’s total. Current 
lithium-ion batteries reduce the 
need for cobalt by mixing in man- 
ganese and nickel, but stabilizing 
the layered cathode still requires 
some cobalt. “We have to move 
away from that,” Persson says. 

Another major research thrust 
is replacing liquid electrolytes 
with polymers and other solid 
materials. If perfected, solid elec- 
trolytes could enable researchers 
to return to using solid lithium 
anodes without fear of creating 
dendrites. That could lead to bat- 
teries able to store enough energy to propel 
electric vehicles as far as gas-powered cars 
without recharging, power short-hop elec- 
tric planes, and even store backup electric- 
ity from wind and solar plants to light up 
cities at night and when the wind is still. 

Goodenough, 97, is the oldest person ever 
to receive a Nobel Prize. He was in London 
to accept another award when he got word. 
Asked during a press conference what he 
hopes for on his return to his lab in Texas, 
he said, “I hope they will still keep me em- 
ployed.” Whittingham, 77, has no plans to 
cut back his research either. “I’m going to 
follow John Goodenough’s example,” he 
says. “I’m still full steam ahead.” The same 
is certainly true for their creation. 
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Scans could 

reveal where and 
how consciousness 
exists in the brain. 


Rival theories face off over 
brain’s source of consciousness 


$20 million project puts competing ideas to the test 


By Sara Reardon 


rain scientists can watch neurons fire 

and communicate. They can map how 

brain regions light up during sensa- 

tion, decision-making, and speech. 

What they can’t explain is how all 

this activity gives rise to conscious- 
ness. Theories abound, but their advocates 
often talk past each other and interpret the 
same set of data differently. “Theories are 
very flexible,’ says Christof Koch, president 
of the Allen Institute for Brain Science in 
Seattle, Washington. “Like vampires, they’re 
very difficult to slay.” 

Now, the Templeton World Charity Foun- 
dation (TWCF), a nonprofit best known 
for funding research at the intersection of 
science and religion, hopes to narrow the 
debate with experiments that directly pit 
theories of consciousness against each other. 
The first phase of the $20 million project, 
launched this week at the Society for Neuro- 
science meeting in Chicago, Illinois, will 
compare two theories of consciousness by 
scanning the brains of participants during 
cleverly designed tests. Proponents of each 
theory have agreed to admit it is flawed if 
the outcomes go against them. 

Head-to-head contests are rare in basic sci- 
ence. “It’s a really outlandish project,” says 
principal investigator Lucia Melloni, a neuro- 
scientist at the Max Planck Institute for Em- 
pirical Aesthetics in Frankfurt, Germany. But 
understanding consciousness has become 
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increasingly important for researchers seek- 
ing to communicate with locked-in patients, 
determine whether artificial intelligence 
systems can become conscious, or explore 
whether animals experience consciousness 
the way humans do. To winnow the theories, 
TWCEF took inspiration from a 1919 experi- 
ment in which physicist Arthur Eddington 
pitted Albert Einstein’s theory of general 
relativity against Isaac Newton’s gravitational 
theory. Eddington measured how the Sun’s 
gravity caused light from nearby stars to shift 
during a solar eclipse—and Einstein won. 

Consciousness, however, is much less eas- 
ily tested or mathematically defined than 
gravity. TWCF has identified a half-dozen 
leading consciousness theories and intends 
to eventually fund research to test them all. 
“What we hope for is a process that reduces 
the number of incorrect theories,” says 
foundation President Andrew Serazin, who 
is based in Nassau, Bahamas. “We want to 
reward people who are courageous in their 
work, and part of having courage is having 
the humility to change your mind.” 

The first two contenders are the global 
workspace theory (GWT), championed by 
Stanislas Dehaene of the Collége de France in 
Paris, and the integrated information theory 
(IIT), proposed by Giulio Tononi of the Uni- 
versity of Wisconsin in Madison. The GWT 
says the brain’s prefrontal cortex, which con- 
trols higher order cognitive processes like 
decision-making, acts as a central computer 
that collects and prioritizes information from 


Published by AAAS 


sensory input. It then broadcasts the infor- 
mation to other parts of the brain that carry 
out tasks. Dehaene thinks this selection pro- 
cess is what we perceive as consciousness. By 
contrast, the IIT proposes that conscious- 
ness arises from the interconnectedness of 
brain networks. The more neurons interact 
with one another, the more a being feels 
conscious—even without sensory input. IIT 
proponents suspect this process occurs in 
the back of the brain, where neurons con- 
nect in a gridlike structure. 

To test the schemes, six labs will run ex- 
periments with a total of more than 500 par- 
ticipants, costing the foundation $5 million. 
The labs, in the United States, Germany, the 
United Kingdom, and China, will use three 
techniques to record brain activity as volun- 
teers perform consciousness-related tasks: 
functional magnetic resonance imaging, 
electroencephalography, and electrocortico- 
graphy (a form of EEG done during brain sur- 
gery, in which electrodes are placed directly 
on the brain). In one experiment, research- 
ers will measure the brain’s response when a 
person becomes aware of an image. The GWT 
predicts the front of the brain will suddenly 
become active, whereas the IIT says the back 
of the brain will be consistently active. 

Tononi and Dehaene have agreed to pa- 
rameters for the experiments and have reg- 
istered their predictions. To avoid conflicts 
of interest, the scientists will neither collect 
nor interpret the data. If the results appear 
to disprove one theory, each has agreed to 
admit he was wrong—at least to some extent. 

The project has drawn criticism, mostly 
because it includes the IIT. Anil Seth, a 
neuroscientist at the University of Sus- 
sex in Brighton, U.K., says the theory is too 
philosophical—attempting to explain why 
consciousness exists, rather than how the 
brain determines whether a stimulus is wor- 
thy of conscious attention—to be directly 
testable. “I don’t think [the competition] will 
do what it says on the tin,” he says. Tononi 
himself doubts the experiments could rule 
out all aspects of his theory, although he says 
they could “make our life more difficult.” 

Koch, who supports the IIT, disagrees 
with Seth’s assessment. “Of course it’s test- 
able,” he says. He points to clinical trials in 
which researchers stimulate the brains of 
people in minimally conscious states such 
as comas or under anesthesia, triggering 
brain activity patterns that suggest neurons 
interconnect in the way the IIT predicts. 

Despite his misgivings about the project’s 
prospect for a decisive answer, Seth says 
it will spark discussion and collaboration 
among scientific rivals. “That itself is to be 
applauded,” he says. 


Sara Reardon is a journalist in Bozeman, Montana. 
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Texas cancer agency seeks new vote of approval 


Ballot initiative for CPRIT would raise another $3 billion to fuel state’s cancer research 


By Jocelyn Kaiser 


even years ago, when Francesca Cole 
was ending her postdoc at Memo- 
rial Sloan Kettering Cancer Center 
in New York City, she got a Texas-size 
offer: a $3 million startup package to 
join the University of Texas’s (UT’s) 
MD Anderson Cancer Center in Houston. 
Most of that sum came from the $3 billion 
Cancer Prevention & Research Institute of 
Texas (CPRIT) in Austin, an unusual state 
initiative that voters approved in 2007. 

On 5 November, Texas residents will de- 
cide whether to sustain CPRIT, the second 
largest public source of cancer funding in 
the United States after the fed- 
eral government. At stake is its 
generous support for 123 tenure- 
track faculty like Cole, who in- 
vestigates cancer and DNA re- 
pair. Most scientists use fruit 
flies or zebra fish, but she could 
afford to build a large team that 
probes DNA repair using more 
sophisticated—and expensive— 
animal models: 25 different ge- 
netically modified mice strains. 
“The CPRIT investment made a 
huge difference to my research 
success,” says Cole, who has 
since won a prestigious New 
Innovator Award from the Na- 
tional Institutes of Health. 

Texas residents appear likely 
to approve the ballot initiative, 
which would give CPRIT an- 
other $3 billion through bond 
sales; a recent poll found that two-thirds 
of voters support it. Yet some dissent re- 
mains from fiscal conservatives. State Sena- 
tor Charles Schwertner (R) told the Austin 
American-Statesman in January that al- 
though CPRIT’s goals are “unquestionably 
noble,’ funding cancer research is not a 
role for state government. He introduced a 
bill to have CPRIT become a self-sufficient 
agency, but it failed to advance. 

Still, CPRIT and its supporters aren’t let- 
ting up on their message that the agency 
has been a boon for Texas’s research efforts 
and the state’s economy. “The investment 
CPRIT has made has accelerated cancer 
research and the development of the can- 
cer ecosystem in Texas beyond our expec- 
tations,” says the agency’s chief scientific 
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1447 awards 
$2.41 billion 


officer, James Willson. Geneticist Richard 
Kolodner of the University of California, 
San Diego, who chairs CPRIT’s external 
scientific review council, says the initiative 
will yield an even bigger payoff in the fu- 
ture. “Ten years is a short period of time for 
a scientific program.” 

California’s $3 billion stem cell agency 
was CPRIT’s original inspiration, although 
California was responding to federal restric- 
tions on stem cell studies, whereas Texas 
wanted to bolster the state’s research efforts 
and its biotech industry. After a smooth first 
few years, a scandal broke out in 2012 over a 
$18 million incubator award to MD Ander- 
son that had not undergone scientific peer 


State of cancer funding 
Texas's cancer funding agency has handed out most of its initial $3 billion, boosting 
prevention services, academic research, and product development. 


$670.7 million 


$575.2 million 


$540.6 million 


$250 million 


review. That, along with concerns that poli- 
tics was skewing grant decisions, prompted 
CPRIT’s chief scientific officer, Nobel laure- 
ate and biochemist Alfred Gilman, to resign 
in protest, along with most of its scientific 
council and many grant reviewers. After 
more problems led to a 10-month hold on 
new grants and a governance overhaul, the 
agency got back on track. 

CPRIT has awarded more than $2.4 bil- 
lion for 1447 awards split among clinical 
and translational research, recruitment, 
basic research, training, and prevention. It 
has supported shared resources such as bio- 
informatics facilities and advanced micro- 
scopes. The agency touts its practical im- 
pact, saying 36 cancer companies have used 
its money to launch, grow, or move to the 
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state. They, in turn, have raised more than 
$3 billion from investors. A recent analysis 
commissioned by CPRIT concluded that the 
money it pumps into the economy gener- 
ates $1.4 billion in annual economic ac- 
tivity and supports 10,000 jobs. And one 
“immediate” result of CPRIT’s $250 mil- 
lion in prevention grants has been cancer 
screening and other services for 320,000 
Texans a year, Willson says. 

Kolodner highlights CPRIT’s support 
for recruiting 181 researchers as “the 
gift that keeps on giving.” They include 
heavyweights such as MD Anderson’s Jim 
Allison, who won a Nobel Prize last year for 
his work on immunotherapies, and stem 
cell researcher Sean Morrison 
of UT’s Southwestern Medical 
Center in Dallas. But one large 
investment misfired: Mouse 
geneticists Neal Copeland and 
Nancy Jenkins, recruited in 2011 
from Singapore to the Method- 
ist Hospital Research _Insti- 
tute in Houston with a 5-year, 
$29 million joint package—half 
from CPRIT—retired after that 
money ran out. The agency has 
since capped its share of such 
awards to $6 million in order to 
recruit more scientists. 

CPRIT says its recruits have 
drawn additional grant money 
from other sources. Yet Texas’s 
National Cancer Institute fund- 
ing, which totaled $249 mil- 
lion in 2018, has risen only 16% 
since 2008, below NCTI’s overall 
budget growth of 23% over the same de- 
cade. Willson counters that since CPRIT’s 
inception, NCI has tabbed two Texas sites as 
new Comprehensive Cancer Centers, which 
“catalyze something much greater than sum 
of the parts.” (MD Anderson was already a 
center.) Willson adds that many early-career 
CPRIT scholars are just now getting their 
first NCI grants, and he says funding from 
private research foundations has ballooned. 

CPRIT may still be an experiment, but 
Texas voters will almost certainly give it a 
chance to continue. That stands in contrast 
to California, where efforts to renew the 
California Institute for Regenerative Medi- 
cine face an uphill battle. Kolodner says, “I 
think sadly people probably relate to cancer 
much more than stem cells.” 
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Pint-size proteins, found everywhere from venom 
to muscles, can block or boost their bigger kin sy mitch Lestie 


ice put human runners to 
shame. Despite taking puny 
strides, the rodents can log 
10 kilometers or more per night 
on an exercise wheel. But the 
mice that muscle biologist Eric 
Olson of the University of Texas 
Southwestern Medical Center in 
Dallas and colleagues unveiled 
in 2015 stood out. On a treadmill, the mice 
could scurry up a steep 10% grade for about 
90 minutes before faltering, 31% longer than 
other rodents. Those iron mice differed from 
counterparts in just one small way—the re- 
searchers had genetically altered the animals 
to lack one muscle protein. That was enough 
to unleash superior muscle performance. “It’s 
like you’ve taken the brakes off;’ Olson says. 

Just as startling was the nature of the 
crucial protein. Muscles house some gar- 
gantuan proteins. Dystrophin, a structural 
protein whose gene can carry mutations 
that cause muscular dystrophy, has more 
than 3600 amino acids. Titin, which acts 
like a spring to give muscles elasticity, is 
the biggest known protein, with more than 
34,000 amino acids. The protein disabled in 
the mice has a paltry 46. Although research- 
ers have probed how muscles work for more 
than 150 years, they had completely missed 
the huge impact this tiny protein, called myo- 
regulin, has on muscle function. 

Olson and his colleagues weren’t the only 
ones to be blindsided by Lilliputian pro- 
teins. As scientists now realize, their initial 
rules for analyzing genomes discriminated 
against identifying those pint-size molecules. 
Now, broader criteria and better detection 


296 18 OCTOBER 2019 + VOL 366 ISSUE 6463 


methods are uncovering minuscule proteins 
by the thousands, not just in mice, but in 
many other species, including humans. “For 
the first time, we are about to explore this 
universe of new proteins,” says biochemist 
Jonathan Weissman of the University of Cali- 
fornia, San Francisco. 

Biologists are just beginning to delve into 
the functions of those molecules, called micro- 
proteins, micropeptides, or miniproteins. 
But their small size seems to allow them to 
jam the intricate workings of larger proteins, 
inhibiting some cellular processes while un- 
leashing others. Early findings suggest micro- 
proteins bolster the immune system, control 
destruction of faulty RNA molecules, protect 
bacteria from heat and cold, dictate when 
plants flower, and provide the toxic punch 
for many types of venom. “There’s probably 
going to be small [proteins] involved in all 


The venom of this predatory water bug has 
more than a dozen small proteins. 
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biological processes. We just haven’t looked 
for them before,’ says biochemist Alan 
Saghatelian of the Salk Institute for Biological 
Studies in San Diego, California. 

Small proteins also promise to revise the 
current understanding of the genome. Many 
appear to be encoded in stretches of DNA— 
and RNA—that were not thought to help build 
proteins of any sort. Some researchers spec- 
ulate that the short stretches of DNA could 
be newborn genes, on their way to evolving 
into larger genes that make full-size proteins. 
Thanks in part to small proteins, “We need 
to rethink what genes are,’ says microbio- 
logist and molecular biologist Gisela Storz 
of the National Institute of Child Health and 
Human Development in Bethesda, Maryland. 

Despite the remaining mysteries, scientists 
are already testing potential uses for the mol- 
ecules. One company sells insecticides de- 
rived from small proteins in the poison of an 
Australian funnel-web spider. And a clinical 
trial is evaluating an imaging agent based on 
another minute protein in scorpion venom, 
designed to highlight the borders of tumors 
so that surgeons can remove them more pre- 
cisely. Many drug companies are now search- 
ing for small proteins with medical potential, 
says biochemist Glenn King of the University 
of Queensland in St. Lucia, Australia. “It’s one 
of the most rapidly growing areas.” 


OTHER SHORT AMINO ACID chains, often called 
peptides or polypeptides, abound in cells, 
but they are pared-down remnants of bigger 
predecessors. Myoregulin and its diminutive 
brethren, in contrast, are born small. How 
tiny they can be remains unclear. Fruit flies 
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rely on a microprotein with 11 amino acids 
to grow normal legs, and some microbes may 
crank out proteins less than 10 amino acids 
long, notes microbial genomicist Ami Bhatt 
of Stanford University in Palo Alto, Califor- 
nia. But even the largest small proteins don’t 
measure up to average-size proteins such as 
alpha amylase, a 496-amino-acid enzyme in 
our saliva that breaks down starch. 

Few small proteins came to light until re- 
cently because of a criterion for identifying 
genes set about 20 years ago. When scientists 
analyze an organism’s genome, they often 
scan for open reading frames (ORFs), which 
are DNA sequences demarcated by signals 
that tell the cell’s ribosomes, its protein- 
making assembly lines, where to start and 
stop. In part to avoid a data deluge, past re- 
searchers typically excluded any ORF that 
would yield a protein smaller than 100 amino 
acids in eukaryotes or 50 amino acids in bac- 
teria. In yeast, for example, that cutoff lim- 
ited the list of ORFs to about 6000. 

Relaxing that criterion reveals that cells 
carry vastly more ORFs. Earlier this year, 
Stanford postdoc Hila Sberro Livnat, Bhatt, 
and colleagues trawled genome fragments 
from the microbes that inhabit four parts 
of the human body, including the gut and 
skin. By searching for small ORFs that could 
encode proteins between five and 50 amino 
acids long, the researchers identified about 
4000 families of potential microproteins. Al- 
most half resemble no known proteins, but 
the sequence for one small ORF suggested 
that a corresponding protein resides in 
ribosomes—a hint that it could play some 
fundamental role. “It’s not just genes with es- 
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Myoregulin is small compared with a typical protein and 
minute next to a behemoth such as titin. 


oteric functions that have been missed” when 
scientists overlooked small ORFs, Bhatt says. 
“Tt’s genes with core functions.” 

Other cells also house huge numbers of 
short ORFs—yeast could make more than 
260,000 molecules with between two and 
99 amino acids, for example. But cells al- 
most certainly don’t use all those ORFs, and 
some of the amino acid strings they produce 
may not be functional. In 2011, after finding 
more than 600,000 short ORFs in the fruit 
fly genome, developmental geneticist Juan 
Pablo Couso of the University of Sussex in 
Brighton, U.K., and colleagues tried to whit- 
tle down the number. They reasoned that if 
a particular ORF had an identical or near- 
identical copy in a related species, it was less 
likely to be genomic trash. After searching 
another fruit fly’s genome and analyzing 
other evidence that the sequences were be- 
ing translated, the group ended up with a 
more manageable figure of 401 short ORFs 
likely to yield microproteins. That would 
still represent a significant fraction of the in- 
sects’ protein repertoire—they harbor about 
22,000 full-size proteins. 
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Weissman and colleagues found micro- 
proteins a second way, through a method 
they invented to broadly determine which 
proteins cells are making. To fashion any 
protein, a cell first copies a gene into mes- 
senger RNA. Then ribosomes read the mRNA 
and string together amino acids in the order 
it specifies. By sequencing mRNAs attached 
to ribosomes, Weissman and his team pin- 
point which ones cells are actually turning 
into proteins and where on the RNAs a ribo- 
some starts to read. In a 2011 Cell study, he 
and his team applied that ribosome profil- 
ing method, also called Ribo-seq, to mouse 
embryonic stem cells and discovered the 
cells were making thousands of unexpected 
proteins, including many that would fall be- 
low the 100-amino-acid cutoff. “It was quite 
clear that the standard understanding had 
ignored a large universe of proteins, many of 
which were short,’ Weissman says. 

Saghatelian and his colleagues adopted a 
third approach to discover a trove of micro- 
proteins in our own cells. The researchers 
used mass spectrometry, which involves 
breaking up proteins into pieces that are 
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Atumor seen in 
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sorted by mass to produce a distinctive spec- 
trum for each protein. Saghatelian, his then- 
postdoc Sarah Slavoff, and colleagues applied 
the method to protein mixtures from human 
cells and then subtracted the signatures of 
known proteins. That approach revealed 
spectra for 86 previously undiscovered tiny 
proteins, the smallest just 18 amino acids 
long, the researchers reported in 2013 in Na- 
ture Chemical Biology. 


BEING SMALL LIMITS a protein’s capabilities. 
Larger proteins fold into complex shapes 
suited for a particular function, such as cata- 
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lyzing chemical reactions. Proteins smaller 
than about 50 to 60 amino acids probably 
don’t fold, says chemist Julio Camarero of 
the University of Southern California in Los 
Angeles. So they probably aren’t suited to be 
enzymes or structural proteins. 

However, their diminutive size also opens 
up opportunities. “They are tiny enough to 
fit into nooks and crannies of larger proteins 
that function as channels and receptors,” 
Olson says. Small proteins often share short 
stretches of amino acids with their larger 
partners and can therefore bind to and alter 
the activity of those proteins. Bound micro- 
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proteins can also shepherd bigger molecules 
to new locations—helping them slip into cell 
membranes, for instance. 

Because of their attraction to larger pro- 
teins, small proteins may give cells a revers- 
ible way to switch larger proteins on or off. 
In a 2016 study in PLOS Genetics, plant de- 
velopmental biologist Stephan Wenkel of the 
University of Copenhagen and colleagues 
genetically altered Arabidopsis plants to pro- 
duce extra amounts of two small proteins. 
The plants normally burst into flower when 
the days are long enough, but when they 
overproduced the two microproteins, their 
flowering was postponed. The small proteins 
caused that delay by blocking a hefty protein 
called CONSTANS that triggers flowering. 
They tether CONSTANS to other inhibitory 
proteins that shut it down. “A cell uses things 
that help it survive. If a short protein does the 
job, that’s fine,” Saghatelian says. 

Those jobs include other key tasks. In 2016, 
Slavoff, Saghatelian, and colleagues revealed 
that human cells manufacture a 68-amino- 
acid protein they named NoBody that may 
help manage destruction of faulty or un- 
needed mRNA molecules. NoBody’s name 
reflects its role in preventing formation of 
processing bodies (P-bodies), mysterious 
clusters in the cytoplasm where RNA break- 
down may occur. When the protein is miss- 
ing, more P-bodies form, thus boosting RNA 
destruction and altering the cell’s internal 
structure. “It shows that small proteins can 
have massive effects in the cell,’ Slavoff says. 

Muscles appear to depend on a variety of 
microproteins. During embryonic develop- 
ment, individual muscle cells merge into 
fibers that power contraction. The 84-amino- 
acid protein myomixer teams up with a larger 
protein to bring the cells together, Olson’s 
team reported in 2017 in Science. Without it, 
embryonic mice can’t form muscles and are 
almost transparent. 

Later in life, myoregulin steps in to help 
regulate muscle activity. When a muscle re- 
ceives a stimulus, cellular storage depots spill 
calcium, triggering the fibers to contract and 
generate force. An ion pump called SERCA 
then starts to return the calcium to storage, 
allowing the muscle fibers to relax. Myoregu- 
lin binds to and inhibits SERCA, Olson’s team 
found. The effect limits how often a mouse’s 
muscles can contract—perhaps ensuring 
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that the animal has muscle power in reserve 
for an emergency, such as escaping a preda- 
tor. Another small protein, DWORF, has the 
opposite effect, unleashing SERCA and en- 
abling the muscle to contract repeatedly. 

Even extensively studied organisms such 
as the intestinal bacterium Escherichia coli 
harbor unexpected small proteins that have 
important functions. Storz and her team re- 
ported in 2012 that a previously undiscovered 
49-amino-acid protein called AcrZ helps the 
microbe survive some antibiotics by stimulat- 
ing a pump that expels the drugs. 

And the venom produced by a variety 
of organisms—including spiders, 
centipedes, scorpions, and poi- 
sonous mollusks—teems with tiny 
proteins. Many venom components 
disable or kill by blocking the chan- 
nels for sodium or other ions that 
are necessary for transmission of 
nerve impulses. Small proteins “hit 
these ion channels with amazing 
specificity and potency,’ King says. 
“They are the major components 
of venoms and are responsible for 
most of the pharmacological and 
biological effects.” 

Australia’s giant fish-killing water 
bug (see p. 296), for instance, doesn’t 
just rely on sharp claws and lancelike 
mouthparts to subdue prey. It injects 
its victims with a brew of more than 130 pro- 
teins, 15 of which have fewer than 100 amino 
acids, King and colleagues reported last year. 


UNLIKE HULKING PROTEINS such as anti- 
bodies, microproteins delivered by pill or 
injection may be able to slip into cells and 
alter their functions. Captopril, the first of a 
class of drugs for high blood pressure known 
as angiotensin-converting enzyme inhibitors 
was developed from a small protein in the 
venom of a Brazilian pit viper. But the drug, 
which the Food and Drug Administration ap- 
proved for sale in the United States in 1981, 
was discovered by chance, before scientists 
recognized small proteins as a distinct group. 
So far, only a few microproteins have reached 
the market or clinical trials. 

Cancer researchers are trying to capitalize 
on a microprotein in the poison of the death- 
stalker scorpion (Leiurus quinquestriatus) of 
Africa and the Middle East. The molecule has 
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a mysterious attraction to tumors. By fusing 
it to a fluorescent dye, scientists hope to il- 
luminate the borders of brain tumors so that 
surgeons can safely cut out the cancerous tis- 
sue. “It lights up the tumor. You can see the 
margins and if there are any metastases,” 
King says. A clinical trial is now evaluating 
whether the dual molecule can help surgeons 
remove brain tumors in children. 

How important small proteins will be for 
medicine is still unknown, but they have 
already upended several biological assump- 
tions. Geneticist Norbert Hiibner of the 
Max Delbriick Center for Molecular Medi- 


Bacteria such as Escherichia coli also churn out many microproteins, 
although their functions remain unclear in many cases. 


cine in Berlin and colleagues found dozens 
of new microproteins in human heart cells. 
The group traced them to an unexpected 
source: short sequences within long noncod- 
ing RNAs, a variety that was thought not to 
produce proteins. After identifying 169 long 
noncoding RNAs that were probably being 
read by ribosomes, Hiibner and his team 
used a type of mass spectrometry to confirm 
that more than half of them yielded micro- 
proteins in heart cells, a result reported ear- 
lier this year in Cell. 

The DNA sequences for other tiny pro- 
teins also occur in unconventional loca- 
tions. For example, some lie near the ORFs 
for bigger proteins. Researchers previously 
thought those sequences helped manage the 
production of the larger proteins, but rarely 
gave rise to proteins themselves. Some cod- 
ing sequences for recently discovered micro- 
proteins are even nested within sequences 
that encode other, longer proteins. 
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Those genomic surprises could illu- 
minate how new genes arise, says evolu- 
tionary systems biologist Anne-Ruxandra 
Carvunis of the University of Pittsburgh 
in Pennsylvania. Researchers had thought 
most new genes emerge when existing 
genes duplicate or fuse, or when species 
swap DNA. But to Carvunis, microproteins 
suggest protogenes can form when mu- 
tations create new start and stop signals 
in a noncoding portion of the genome. If 
the resulting ORF produces a beneficial 
protein, the novel sequences would re- 
main in the genome and undergo natural 
selection, eventually evolving into 
larger genes that code for more 
complex proteins. 

In a 2012 study, Carvunis, who 
was then a postdoc in the lab of 
Mare Vidal at the Dana-Farber 
Cancer Institute in Boston, and 
colleagues found that yeast trans- 
late more than 1000 short ORFs 
into proteins, implying that these 
sequences are protogenes. In a 
new study, Carvunis and her team 
tested whether young ORFs can be 
advantageous for cells. They genet- 
ically altered yeast to boost output 
of 285 recently evolved ORFs, most 
of which code for molecules that 
are smaller the standard protein 
cutoff or just over it. For almost 10% of the 
proteins, increasing their levels enhanced 
cell growth in at least one environment. 
The results, posted on the preprint server 
bioRxiv, suggest these sequences could 
be on their way to becoming full-fledged 
genes, Carvunis says. 

Slavoff still recalls being astonished 
when, during her interview for a postdoc 
position with Saghatelian, he asked whether 
she would be willing to go hunting for small 
proteins. “I had never thought that there 
could be this whole size of proteins that was 
dark to us until then.” 

But the bet paid off—she now runs her 
own lab that is searching for microproteins. 
Recently, she unleashed some of her post- 
docs and graduate students on one of the 
most studied organisms, the K12 strain of 
FE. coli. The team soon uncovered five new 
microproteins. “We are probably only 
scratching the surface,” she says. 
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EVOLUTION 


Evolutionary history 
of tissue bending 


Newly discovered unicellular eukaryote shows 
hallmark feature of animal morphogenesis 


By Pavel Tomancak 


nimal embryos shape their tissues 

during development through a variety 

of mechanisms, one of which involves 

coordinated constriction of one side of 

a sheet of cells, leading to tissue bend- 

ing. This apical constriction mecha- 
nism of tissue morphogenesis occurs in most 
animal groups (J), suggesting that it was in- 
herited from a common ancestor. On page 
326 of this issue, Brunet e¢ al. (2) describe the 
remarkable discovery of a new species of col- 
ony-forming unicellular eukaryote that uses 
collective cell contraction to change their 
morphology and behavior in response to a 
lack of light. Because the new species belongs 
to the closest living relatives of animals, the 
choanoflagellates (3), these results cast new 
light on the evolutionary origin of collective 
cell contractility. 

In a pond above the tidal line on a beach 
in the Caribbean island of Curacao, Brunet 
et al. found a remarkable creature com- 
posed of ~100 cells with whip-like append- 
ages (flagella) that together formed a small 
sheet. The sheet was bent inward, with the 
flagella pointing toward the interior of a 
cup-shaped colony. However, occasionally 
the sheet would invert its curvature rapidly, 
and within 30 s all the flagella would be fac- 
ing outward along the radius of the inverted 
colony (see the image). This change in tis- 
sue morphology is similar to developmen- 
tal morphogenesis in complex multicellular 
animals, in which epithelial sheets fold and 
invaginate to form multilayered structures. 


Max Planck Institute of Molecular Cell Biology and Genetics, 
Dresden, Germany. Email: tomancak@mpi-cbg.de 
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Brunet et al. studied the creature and 
on the basis of the morphology of the cells 
making up the sheet concluded that it was 
a choanoflagellate. Many researchers study 
these unicellular eukaryotes that some- 
times form multicellular colonies (4), owing 
to their distinct evolutionary relatedness to 
animals. What Brunet et al. found was a new 
species of choanoflagellate, named by the 
authors Choanoeca flexa. The inversion be- 
havior is specific to C. flexa but has been ob- 
served before. In 1983, a study of the closely 
related Choanoeca perplexa mentions such 
inversion behavior (5); however, that colony 


did not thrive in the laboratory and was 
lost and never studied again. Brunet et al. 
managed to keep their C. flexa alive in the 
laboratory by growing it together with the 
bacteria it fed on in the pond in which it 
was discovered. What followed is a triumph 
of unleashing the power of modern biology 
on a newly discovered species. 

The authors found that the inversion 
behavior is triggered by switching off the 
light. Analyses of the C. flera genome re- 
vealed a gene, encoding the protein rho- 
dopsin phosphodiesterase (6), that could 
both sense light and pass the information 


Myosin-mediated morphogenesis in choanoflagellates and animals 
In choanoflagellates, single cells or sheets of connected cells contract because of actomyosin activity. This 
behavior is triggered by darkness and results in a change in morphology (opening of collar microvilli and 
sheet inversion) and behavior (from feeding to swimming). In animal embryos, local myosin-induced apical 
constriction leads to epithelial tissue invagination during gastrulation. 
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to other cellular processes by modifying 
cyclic nucleotides that are commonly used 
in intracellular signaling. This gene was a 
good candidate to mediate the observed 
light-dependent behavior, but how could 
this be proven in a newly discovered species 
in which gene expression cannot be ma- 
nipulated? Brunet et al. designed an elegant 
experiment to address this question. 

Photodetection with rhodopsin requires a 
cofactor, a chromophore called retinal (7). C. 
JSlexa does not have the genes necessary to 
produce retinal. Therefore, it must be getting 
retinal from food, the bacteria. The authors 
fed their choanoflagellates with bacteria that 
can or cannot produce retinal and found that 
only colonies fed on retinal-producing bacte- 
rial strains can invert in darkness. Moreover, 
addition of synthetic retinal alone triggered 
the inversion. But what does inversion do? 
The authors showed that when the cell col- 
ony has its flagella facing inward, it can eat 
better by capturing the bacteria from the 
surrounding medium. However, when it in- 
verts, the flagella are oriented in a way that 
is more useful for moving the colony around, 
potentially to escape a large predator cast- 
ing a shadow over the colony. Therefore, the 
sheet inversion mediates a trade-off between 
feeding and swimming. 

Having established the regulation and 
ecological relevance of the inversion behav- 
ior, Brunet et al. examined the mechanism of 
inversion. They noticed from microscopy ob- 
servations that the individual cells of C. flexa 
hold together by means of contacts between 
cellular protrusions surrounding the flagel- 
lum that are called collar microvilli. This is 
avery specific way of holding a sheet of cells 
together, quite distinct from how epithe- 
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lial sheets of animal cells are connected by 
junctional complexes. However, the C. flexa 
shares a remarkable molecular feature with 
animal epithelia: At the base of the collar, 
there is a prominent actomyosin ring, a cy- 
toskeletal feature that is associated with cel- 
lular contractility in animal cells (8). During 
inversion, the ring contracts and the collar 
opens up. Chemical inhibitors of actomyosin 
contractility abolish this behavior. Therefore, 
the inversion behavior of choanoflagellates 
uses the same actomyosin molecular ma- 
chinery deployed by animal cells to sculpt 
tissues during development through apical 
constriction (see the figure). 

Myosin protein sequences are highly simi- 
lar between choanoflagellates and animals 
(9), and therefore, the myriad of molecular 
reagents developed in laboratories studying 
myosin in animal cells can be used in choano- 
flagellates. Brunet et al. used these reagents 
to visualize myosin distribution in several 
species across the choanoflagellate evolution- 
ary tree and found that they all have the myo- 
sin rings observed in C. flexa. 

Therefore, the study of Brunet et al. indi- 
cates that contractile actomyosin machin- 
ery was inherited by choanoflagellates and 
animals from a common ancestor. Animals 
have evolved ways to use this contractile ma- 
chinery to bend epithelial tissues by means 
of apical constriction—for example, during 
gastrulation. In choanoflagellates such as C. 
flexa, the machinery mediates collective cell 
inversion behavior in cell colonies. In ani- 
mals, the contractility is regulated through 
mechanochemical signals and feedback dur- 
ing morphogenesis (0), whereas in choano- 
flagellates, the contractility is directly linked 
to environmental stimuli such as absence 
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The successive steps of the inversion process. 
The flagella are at the apical pole of each 

cell (green) where the collar of microvilli (red), 
which drive inversion, are also located. 


of light and the decisions between feeding 
and predator-escaping motile behaviors. 
Actomyosin contractility is therefore an an- 
cient morphogenetic module (17) that has 
been, most likely independently, co-opted 
to achieve diverse morphogenetic outcomes 
under vastly different regulation in animals 
and choanoflagellates. 

The new species was discovered only a 
year ago, and yet Brunet et al. were able to 
convincingly connect the molecular mecha- 
nisms of actomyosin contraction to the col- 
lective inversion behavior and its ecological 
importance. This shows that comprehensive 
mechanistic studies are no longer confined to 
established organismal models such as fruit- 
fly, zebrafish, or mouse. On the contrary, the 
study by Brunet ez al. highlights the power 
of looking at biodiversity to uncover new 
biological principles. Their approach of map- 
ping fundamental morphogenetic modules 
across the tree of life promises future insight 
into how organismal shapes evolve. 


REFERENCES AND NOTES 


. A.C.Martin, B. Goldstein, Development 141, 1987 (2014). 
. T.Brunet etal., Science 366, 326 (2019). 

. N.Kingetal., Nature 451, 783 (2008). 

B.S.C. Leadbeater, The Choanoflagellates: Evolution, 
Biology and Ecology (Cambridge Univ. Press, 2015). 
B.S.C. Leadbeater, J. Mar. Biol. Assoc. 63, 135 (1983). 

K. Yoshida et al., J. Biol. Chem. 292, 7531 (2017). 

. O.P.Ernst et al., Chem. Rev. 114,126 (2014). 

J.M. Sawyer et al., Dev. Biol. 341, 5 (2010). 

T.A. Richards, T. Cavalier-Smith, Nature 436, 1113 (2005). 
. E.Hannezo, C.-P. Heisenberg, Cel! 178, 12 (2019). 

S.A. Newman, Philos. Trans. R. Soc. London Ser. B Biol. 
Sci. 371, 20150443 (2016). 


FPwnrr 


FOwmNaOM 


10.1126/science.aaz1289 


18 OCTOBER 2019 + VOL 366 ISSUE 6463 301 


INSIGHTS | PERSPECTIVES 


IMMUNOTHERAPY 


Mobilizing unconventional T cells 


Modulating unconventional antigen presentation could 


treat infections and cancer 


By Tom H. M. Ottenhoff and Simone A. Joosten 


he human leukocyte antigen-E (HLA- 
E) molecule is expressed on all nucle- 
ated cells and presents self-peptides. 
The resulting complexes can bind to 
inhibitory receptors on immune killer 
cells, including ~50% of natural killer 
(NK) cells and a subset (~5%) of CD8* cyto- 
toxic T lymphocytes (CTLs). Interruption of 
this inhibitory axis, which serves as an im- 
mune checkpoint, can improve effector func- 
tions of both CTLs and NK cells and enhance 
antitumor activity (J). HLA-E antigen expres- 
sion can also be used as a mechanism for 
pathogen-infected cells to avoid being killed 
by NK cells. However, HLA-E can present 
peptides from pathogens and tumor cells to 
so-called unconventional CD8* T cells, which 
can then be mobilized to fight the infection 
or tumor. Recent advances in understanding 
this dichotomy between unconventional T 
cell activation and NK cell suppression re- 
veal potential preventive and therapeutic ap- 
plications in infectious diseases and cancer. 

HLA-E is an HLA class Ib molecule and is 
classified as nonclassical because it is rela- 
tively monomorphic. In contrast to classical 
HLA class Ia molecules, which are highly 
polymorphic, the HZAE locus encodes 
only two functional variants, HLA-E*0101 
and HLA-E*0103. These two proteins dif- 
fer by one amino acid (Arg or Gly’, re- 
spectively), which is located outside the 
HLA-E peptide binding groove and is 
therefore considered unlikely to influence 
HLA-E-peptide interactions directly (2, 3). 
Although largely similar, there are slight 
differences in bound-peptide repertoires, 
peptide-binding affinities, the level of ex- 
pression, and the stability of both HLA-E 
molecules for reasons that are poorly un- 
derstood (2). Both variants are maintained 
at comparable frequencies in the human 
population, potentially reflecting a lack of 
selective advantage for either variant. 

HLA class Ia expression is often down- 
regulated in human tumors, which facili- 
tates immune evasion from classical CD8* 
CTL-mediated killing. However, HLA class 
Ia down-regulation can make tumors sus- 
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ceptible to NK cell-mediated killing be- 
cause HLA class Ia is the main supplier of 
peptides that can bind to, and thus main- 
tain, cell surface expression of HLA-E. It 
has become clear, though, that when HLA 
class Ia is absent, new tumor peptides can 
be presented by HLA-E to activate uncon- 
ventional CD8* T cells through HLA-E- 
peptide-specific recognition by T cell re- 
ceptors (TCRs). There are several types of 
unconventional T cells, which recognize 
antigens through TCRs by means of mono- 
morphic antigen-presentation molecules 
(3). In the case of HLA-E, the unconven- 
tional CD8* T cells reported thus far can 
have both cytotoxic and suppressive prop- 
erties (4, 5). In mice, Qa-1 (the mouse ortho- 
log of HLA-E)-restricted unconventional 
CD8* T cells can confer antitumor activity 
and improve survival (6, 7). These findings 
inspired the search for potential peptides 
in other diseases, including those caused 
by persistent pathogen infections, such as 
human immunodeficiency virus (HIV), cy- 
tomegalovirus (CMV), and Mycobacterium 
tuberculosis (Mtb). This resulted in the 
identification of multiple peptides that are 
recognized by unconventional HLA-E-re- 
stricted CD8* T cells and that were derived 
from Mtb, Epstein Barr virus (EBV), HIV, 
and Salmonella typhi (2, 3), highlighting 
the importance of HLA-E antigen presen- 
tation in the activation of unconventional 
CD8* T cells. 

Peptides from Mtb, HIV, and CMV in 
complex with HLA-E*0103 were recently 
crystallized. The resulting structures 
showed canonical anchoring of peptides 
into the peptide-binding groove (8) and re- 
vealed how peptides might be designed to 
activate HLA-E-restricted T cells for vac- 
cination. Three TCR molecules have been 
co-crystallized with their cognate HLA-E- 
bound peptide, revealing canonical contact 
sites (9). Further investigation is required 
to understand which structural features 
restrict TCRs to HLA-E-peptide complexes. 

Recent studies in nonhuman primates 
(NHPs) revealed the importance of HLA-E- 
restricted T cells in immune responses to 
infectious diseases. Simian immunodefi- 
ciency virus (SIV; the NHP equivalent to 
HIV) antigens were expressed in modified 
rhesus CMV (RhCMV) vectors as a subunit- 
vaccination strategy, because CMV viruses 
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are known to induce strong and _ long- 
lasting CD8* memory T cell responses (2). 
A subunit vaccine elicits immunological 
memory through exposure to selected im- 
munogenic components from a tumor or 
pathogen. Administration of this RhCMV- 
SIV subunit vaccine to NHPs resulted in 
protection against subsequent SIV infec- 
tion, which was mediated through un- 
conventional CD8* T cells that recognized 
SIV peptides bound to either MHC class II 
molecules (which are expressed by antigen- 
presenting cells and present peptides from 
extracellular proteins) or MHC-E (the or- 
tholog of HLA-E), but not conventional 
MHC class Ia molecules. In over half of 
the animals, this resulted in eradication of 
experimentally induced SIV infection (0). 
Similarly designed RhCMV-TB-antigen 
vectors also induced complete protection 
against experimental tuberculosis in 41% 
of treated NHPs. In these animals, equiva- 
lent protection could also be achieved with 
vectors that induced conventional CD8* T 
cells and CD4* T helper cells, suggesting 
redundancy in unconventional and conven- 
tional CD8* T cell responses in the NHP- 
tuberculosis model (77). Regardless of the 
many unresolved questions in HLA-E biol- 
ogy, these data collectively support the can- 
didacy of HLA-E as a targetable pathway for 
vaccination as well as immunotherapy—for 
example, by antibody-mediated blockade 
of CD8* CTL or NK cell-expressed inhibi- 
tory receptor molecules, one of which is 
NK group 2A (NKG2A) (1). In addition to 
the relative monomorphism of HLA-E, an 
advantage of HLA-E-based vaccines over 
traditional vaccine strategies targeting HLA 
class Ia molecules is that HLA-E expression 
is not down-regulated when HIV and Mtb 
infection co-occur, which is an important 
global health issue. 

There is much more to be understood 
about the immunology of HLA-E and how 
this can be translated into vaccines and 
immunotherapies. There is limited under- 
standing of which peptides are optimal 
targets for HLA-E-restricted T cells. De- 
veloping new tools to identify these from 
pathogen—and tumor—genomic sequences 
would be valuable. Moreover, exactly how 
and where peptide antigens are processed 
intracellularly for HLA-E presentation is 
largely unknown. Studies in mice suggest 
a typical endoplasmic reticulum peptide- 
loading pathway for Qa-1 (7). However, in 
the case of Mtb infection, and likely other 
intracellular pathogens, HLA-E can be ex- 
pressed in the phagosome, suggesting an 
alternative site of HLA-E peptide loading 
(3, 7). Understanding the biology of HLA-E 
antigen presentation will be key to the 
design of optimal strategies to target this 
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pathway for unconventional CD8* 
T cell activation. 

Another avenue for further in- 
vestigation is the diversity of the 
TCR repertoire for HLA-E-pre- 
sented ligands. For HLA-E-CMV 
peptides, preferential usage of the 
TCR B-chain variable region (VB16) 


Human leukocyte antigen-E in immunity 
Under homeostatic conditions, HLA-E presents self-peptides and 
prevents NK cell-mediated lysis through the CD94-NKG2A axis, thus 
regulating innate immunity. In addition, HLA-E can present pathogen- 
or tumor-derived peptide antigens to unconventional CD8* T cells, 
which recognize peptide-HLA-E complexes through specific TCRs, 
regulating adaptive immunity. 


Homeostasis 


ling Mtb infection remain unclear. 
At which sites do HLA-E-restricted 
T cells act most prominently—mu- 
cosally or systemically? It will also 
be important to ascertain whether 
these are tissue-resident memory 
cells that can be targeted by muco- 
sal vaccination. 


has been reported, and mouse tu- 
mor models suggest a role for semi- 
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(12, 13). An unanswered question 
is whether HLA-E-restricted un- 
conventional T cells display narrow 
TCR repertories similar to those in 
the mouse, or broader TCR reper- 
toires, and whether these differ ac- 
cording to disease. It also remains 
unclear how the HLA-E-restricted 
TCR repertoire is selected in naive 
T cells in the thymus. 

For successful vaccine or immuno- 
therapy development, it will be criti- 
cal to demonstrate that identified 
HLA-E peptide ligands are expressed 
at the surface of tumor or infected 
cells at densities and durations that 
are sufficient to engage TCRs and 
induce T cell activation and thus 
immunological memory that is im- 
portant in vaccine responses. This 
should include analysis of ligand 
expression in the affected organ—for 
example, the lungs for tuberculosis 
(14). Furthermore, little is known about the 
durability and memory capacity of HLA-E- 
restricted T cells. In mouse tumor models, 
peptide vaccination could induce memory 
CD8* T cells specific for Qa-l-restricted tu- 
mor peptides (13). Data from the RhCMV 
studies in NHPs suggest potent, long-term 
induction of effector memory T cells, but 
this could also reflect the continuous pres- 
ence of antigen expressed from replicating 
RhCMV. In human CMV infection, expan- 
sion of memory CD8* T cells is observed (2), 
probably reflecting the same phenomenon. 

Suitable and affordable small-animal mod- 
els would be of value in exploring HLA-E bi- 
ology, such as the availability of HLA-E*0101 
and HLA-E*0103 transgenic mice. This will 
allow comparative studies of vaccine for- 
mulations using different delivery systems 
(adjuvanted peptides, viral vectors, bacte- 
rial carriers) to optimize vaccine efficacy. In 
mouse tumor models and the RhCMV-SIV 
NHP model, the association of MHC-E and 
Qa-1-restricted T cells with protective immu- 
nity suggests an important role for MHC-E 
and Qa-1, but specific depletion studies have 
thus far not been performed. Mtb infection 
of mice genetically lacking Qa-1 resulted in 
more severe tuberculosis than in wild-type 
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animals, suggesting a protective role for 
Qa-l-restricted CD8* T cell responses dur- 
ing Mtb infection (75). Consistently, Qa-l- 
restricted CD8* T cells were cytolytic and 
could suppress other T cells, a phenotype 
replicated by HLA-E-restricted human CD8* 
T cells cultured in vitro (5). Perhaps HLA-E- 
restricted CD8* T cells contribute to protec- 
tive immunity to Mtb infection, and likely 
other pathogens, by simultaneously killing 
infected cells and inhibiting intracellular 
infection (as shown for Mtb), while also sup- 
pressing inflammation and thereby limiting 
collateral tissue damage. 

The biology of HLA-E is intriguing. For 
example, why are both alleles maintained, 
are they redundant, and what controls the 
unexpected differences between them? Puz- 
zling in this context is the much higher 
number of functional MHCE variants in 
NHPs: ~30 variants have been described (2). 
Additionally, human Mtb-specific HLA-E- 
restricted T cells were described to possess 
aT helper 2 (T,,2)-like phenotype, including 
production of the cytokines interleukin-4 
(IL-4), IL-5, and IL-13, and induced B cell 
activation through IL-4 (5). However, the 
functional role of this T,,2-like phenotype 
and the contribution of B cells in control- 
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peutic approaches. Such strategies 
include preventive or therapeutic 
subunit vaccines that can mobilize 
unconventional T cells, or T cells ex- 
pressing engineered TCRs that rec- 
ognize peptide-HLA-E complexes 
on infected or malignant cells. Al- 
ternatively, high-affinity soluble 
TCR molecules can be engineered 
that target malignant or infected 
cells with high precision. Vaccines 
for infectious and malignant dis- 
eases may be designed on the ba- 
sis of relatively small numbers of 
pathogen- or tumor-derived HLA-E- 
presented peptides, formulated 
in suitable adjuvants. In the case 
of established tumors, T cell-acti- 
vating vaccines may need strong 
potentiation—for example, by com- 
bination with immune checkpoint- 
blocking antibodies that prevent 
NKG2A binding to HLA-E (see the 
figure)—to relieve the immune checkpoint 
on NK and CD8* T cell populations. These 
HLA-E-centered strategies could help to 
improve immune control of infectious dis- 
eases and cancer (J, 3). 


| receptor. 
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Photocatalytic deracemization fixes the mix 


Photocatalysis converts racemic mixtures of \-aryl cyclic ureas into single enantiomers 


By Alison E. Wendlandt 


ery often, only one enantiomer of 
a chiral drug molecule is clinically 
active; the other optical isomer can 
produce unwanted or even cata- 
strophic side effects. Thus, the effi- 


cient synthesis of single 
enantiomers is important in 
small-molecule drug synthe- 
sis, and resolution methods 
are frequently used to enrich 
a molecule’s enantiomeric 
excess (ee) when racemic or 
scalemic mixtures (equal or 
unequal enantiomer mixtures, 
respectively) are obtained (J, 
2). Classical and kinetic reso- 
lutions of mixtures can only 
reach a maximum 50% theo- 
retical yield, and although dy- 
namic kinetic resolution can 
achieve complete conversion 
to a single enantiomer, the re- 
sulting product is chemically 
modified (see the figure, top). 
Direct deracemization of chiral 
compounds reflects an ideal 
if rarely achieved approach to 
producing high-ee mixtures by 
resolution. On page 364 of this 
issue, Shin et al. (3) disclose a 
multicatalytic strategy for en- 
riching racemic mixtures di- 
rectly to a single enantiomer in 
a single step. 

Deracemization methods are 
constrained by the principle of 
detailed balance, which dictates 
that any reverse chemical step 
must be in all ways microscopi- 
cally identical to its correspond- 
ing forward step (4). Because 
(R) and (S) enantiomers are 
energetically identical, any el- 
ementary step or equilibrium 
process (catalyzed or not) that 
converts (R) into (S) must nec- 
essarily also convert (S) back 
into (R); the process will yield 
a racemate. Deracemization 
can be achieved, however, when 
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forward and reverse steps are designed to 
proceed by distinct mechanisms through 
a common prochiral intermediate (see the 
figure, bottom). Highly selective precedents 
are limited, although several notable ap- 
proaches have relied on compartmentaliza- 
tion to separate incompatible oxidant and 


Deracemization versus resolution of enantiomers 
The fraction of a desired enantiomer (either R or S) from a mixture can be 
increased partially or indirectly through resolution methods, or interconverted 
completely through deracemization. 
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In the route reported by 
Shin et al., an iridium 
photocatalyst initiates 
electron transfer from 
N-aryl cyclic ureas. 
Subsequent reactions 
that include a chiral 
base convert the S to 
the R enantiomer. 
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reductant combinations (5, 6) or have used 
nonquenching redox pairs (7) to affect the 
desired deracemization in a single pot. 

As Onsager noted in his seminal mono- 
graph (8), photochemical processes are 
among the handful of exceptions to the 
principle of detailed balance, in that for- 


ward and reverse steps may 
proceed along distinct elec- 
tronic surfaces. A prominent 
example is the contra-thermo- 
dynamic photochemical isom- 
erization of alkenes, wherein 
the selective absorption of light 
can lead to enrichment of the 
less thermodynamically stable 
constitutional isomer. This pro- 
cess constitutes the molecular 
basis for vision in vertebrates, 
and sensitized energy transfer 
has become a powerful syn- 
thetic strategy for obtaining 
(Z)-alkenes from (£)-alkenes 
through a triplet excited state 
(9). Extraordinary recent work 
by Bach and co-workers ele- 
gantly demonstrated this strat- 
egy for the catalytic deracemi- 
zation of chiral allenes (10). 
The method reported by 
Shin et al. takes a complemen- 
tary photochemical approach 
to catalytic deracemization. 
Instead of energy transfer, the 
authors use an iridium photo- 
catalyst to promote reversible 
electron transfer (ET) from 
the substrate. Enantioselective 
deprotonation (proton trans- 
fer, PT) of the resulting radical 
cation occurs in the presence 
of a chiral phosphate base to 
form a prochiral neutral radi- 
cal intermediate with a net 
loss of a hydrogen atom from 
the substrate. This step func- 
tions as a photoinduced kinetic 
resolution, selectively ablating 
the stereochemical informa- 
tion of the faster-reacting (S)- 
substrate and enriching the 
once-racemic starting mixture 
in the (R)-substrate. The radi- 
cal intermediate then reacts 
with a chiral thiol in an en- 
antioselective hydrogen-atom 
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transfer (HAT) step, this time kinetically 
favoring the formation of the (R)-substrate. 
Substoichiometric quantities of both the 
chiral base and chiral thiol can be used. 
The overall process consumes only pho- 
tons, unlike thermal redox deracemiza- 
tions, where stoichiometric oxidants and 
reductants are necessary to drive the reac- 
tion toward a single enantiomer. 

The net result is a highly selective syn- 
thetic deracemization of N-aryl cyclic urea 
substrates. Yet remarkably, neither of the 
two individual steps (ET-PT and HAT) pro- 
ceeds with high selectivity. Mathematically, 
the selectivity-determining step (SDS) is a 
composite of the two individual steps, and 
the observed enantiomeric ratio is the sum 
of the enantiomeric ratios of the two indi- 
vidual steps. The multiplicative form of the 
SDS results in extremely high overall enan- 
tiomeric enrichment through the synergis- 
tic action of these two chiral catalysts, even 
though each catalyst working individually 
is not particularly selective. 

Although the scope of the reaction re- 
ported by Shin e¢ al. is restricted to proof- 
of-concept substrates, undoubtedly excep- 
tional cases such as this will become more 


“high overall enantiomeric 
enrichment through the 
synergistic action of these 
two chiral catalysts...” 


common. In particular, the structural simi- 
larity of the current substrates to impor- 
tant classes of chiral ligands (for example, 
oxazolidinones) and to intermediates in 
enzymatic deracemization and _ stereoin- 
version pathways (for example, hydantoin- 
ase-decarbamoylase systems) hints toward 
the possibility of expanding this approach 
to the deracemization of amino acids and 
other important chiral molecules. 
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Photosystem Il, poised 


for 0, formation 


A consensus is emerging on the structure 
of the oxygen-evolving complex 


By R. David Britt and David A. Marchiori 


he vast majority of oxygen in Earth’s 
atmosphere is generated by photo- 
synthetic light reactions. This process 
also plays an integral role in producing 
energy for Earth’s biosphere. Photosys- 
tem II (PSID uses visible light to oxi- 
dize water and release O,,. Its oxygen-evolving 
complex (OEC) sequentially advances from 
its most reduced state (S,), through four pho- 
ton-driven oxidations, to its most oxidized 
state (S,), which produces O,. As described 
on page 334 of this issue, Suga et al. (1) used 
a powerful x-ray free-electron laser (XFEL) 
to examine in structural detail the OEC just 
prior to O=O bond formation, thereby adding 
to a growing body of work in this area. Such 
mechanistic details of the OEC’s operations 
could help in the development of synthetic 
solar fuel reactions aimed at sustainable fuel 
production from water and sunlight. 

The chemistry of the PSII water oxidation 
process has been the focus of studies cou- 
pling molecular biology and biochemistry 
to spectroscopic methods (2, 3). For many 
years, progress in the field was made without 
a structural basis for interpretation, but since 
the early 2000s the field has greatly benefited 
from steadily improved x-ray crystal struc- 
tures of PSII nominally poised in the dark 
stable state (S,) (4-6). 

The S,-state OEC is a Mn,Ca0O, cluster, and 
the structural changes that occur in this clus- 
ter as it advances through the light-driven 
S-state transitions are a major question in the 
field. Coupled to these structural changes are 
questions concerning the S-state transitions 
in which the two substrate waters bind to 
the cluster and in what chemical form (e.g., 
H,O, OH, bridging, or terminal O*). This is 
additionally complicated by the presence of 
an abundance of oxo bridges (a water-derived 
oxygen molecule that bridges two or more 
Mn/Ca ions) and water molecules within and 
proximal to the OEC. Which of these are the 
actual substrates that are oxidized to form O,? 

In one scenario, both substrate waters 
could bind to the OEC following O, release 
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in the S,-to-S, transition. In this case they 
should be present as OEC ligands in the dark 
stable S, structure, where two waters (W1 and 
W2) are bound to a “dangler” Mn4 (7) and an- 
other two waters (W3 and W4) are bound to 
the Ca ion. This geometry suggests one often- 
discussed model for O=O bond formation, in 
which the dangler Mn4 (or another Mn of the 
cluster’s core) is oxidized to a highly electro- 
philic Mn(V)-oxo species by the S, state. The 
Mn(V)-oxo could then be attacked by a nu- 
cleophilic water or hydroxide on the adjacent 
Ca ion (2, 5). 

However, it is also plausible that one or 
both waters bind to the OEC later in the 
S-state cycle, arriving from some nearby 
solvent water site(s). Siegbahn’s proposed 
model, based on density functional theory 
(DFT) energetics calculations, suggests that a 
water binds to an open coordination site at 
Mn1 in the S,-to-S, transition (8), which upon 
further oxidation to S, forms a Mn(IV)O+ 
species that creates the O=O bond through 
a radical coupling to the bridging O5 (see the 
figure). The conversion of a five-coordinate 
Mn(IV) to a six-coordinate Mn(IV) at the S, 
state, with a new water binding to Mn1 at S,, 
has been supported by recent high-field elec- 
tron paramagnetic resonance (EPR) spectros- 
copy studies (9, 10). 

XFELs produce pulses of a few femtosec- 
onds’ duration with high enough photon en- 
ergy and flux for use in electron diffraction 
of microscale protein crystals. This tool has 
been used to study the OEC’s water oxida- 
tion mechanism. XFEL pulses are sufficiently 
short that the desired x-ray diffraction occurs 
before the microscale crystals are destroyed. 
The XFEL pulses can be synchronized with 
visible-light laser pulses to drive the S-state 
advancement. PSII is a particularly good tar- 
get for this approach because short visible- 
light pulses drive the water-splitting reaction. 
This has allowed investigators to probe the 
structures of the transient S, and S, states, 
with several recent studies focused on the lat- 
ter state, which is only 1 flash/electron trans- 
fer from the formation of O, (I-13). 

There are many important aspects to these 
advanced S-state structures, but the question 
of substrate binding at the S, state is par- 
ticularly crucial, as this is key to proposed 
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mechanisms for O=O bond formation. In 
2016, a study reported a room-temperature 
structure of PSII following two light flashes, 
hence enriched in the S, state of the OEC 
(12). It reported no new S,-state electron den- 
sity around Mnil, as one would expect for a 
newly bound water. This result directly con- 
flicts with the proposed Siegbahn model and 
EPR studies. Shortly following this study, 
another group reported an S, structure with 
new electron density near Mn1 and O5 that 
was assigned to a newly bound oxygen (06) 
(12). This was considered support for the S,- 
insertion substrate water model. However, 
the study showed close proximity (0.15 nm) 
between O5 and O6, consistent with O-O 
bond formation producing a peroxide at the 
S, state, which would in turn be associated 
with Mn reduction in the S,-to-S, transition. 
This is in conflict with EPR and x-ray spec- 
troscopic data (3, 9, 14, 15). A more recent 
study reports a new oxygen “Ox” bound both 
to Mn! and the Ca ion of the OEC (73), and 
with a 0.21 nm distance to 05, much longer 
than the reported 0.15 nm (72) and too long 
to represent the O-O bond of a peroxide. 
Suga et al. have now cryotrapped serially 
flashed microcrystals of PSII with the goal 
of achieving more accurate interatomic dis- 
tances, especially between O5 and O6. In 
the S, state, they observed an open cubane 
structure with a five-coordinate Mn1. At the 
S, state, a flip in the side chain of the mono- 
dentate carboxylate ligand Glu? (12) pro- 
vides room for O6 to insert and bind to Mn1 
and Ca. To address the crucial 05-06 dis- 
tance, Suga et al. calculated difference maps 


as a function of modeled 05-06 distance and 
found the smallest residual density at a dis- 
tance of about 0.19 nm. Their full residual 
density analysis suggests an oxyl/oxo pairing 
for O5 and O6, which they consider evidence 
for an oxyl/oxo coupling mechanism in the 
O=O bond formation at the final S, state. 
Glu” appears to gate water insertion into the 
OEC from the “Ol” water channel of the PSII 
reaction center. 

Although there are still open questions as 
to the detailed assignments of substrate wa- 
ters and the exact modes of water oxidation 
and O=O bond formation, the results from 
different studies are starting to converge, and 
the structural results appear to be closely in 
line with mechanistic proposals supported by 
computational chemistry and spectroscopy. 
The field anxiously awaits possible time- 
resolved structures detailing the O=O bond 
formation through the S,-to-S, transition. 
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The S states in the oxygen-evolution reaction 

The oxygen-evolving complex is photo-oxidized through a series of S states to produce molecular oxygen 
from water. In the final steps before O=O bond formation, a new oxygen, 06, binds to the vacant site at Mn1. After 
a final photo-oxidation event, 05 and 06 appear poised to form an O=O bond, releasing molecular oxygen, 
reducing the cluster, and beginning the catalytic cycle anew. Glutamatic acid at position 189 is noted as E189. 
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Spikes in the 
sleeping brain 


Memory is replayed and 
consolidated under low 


background noise during 
deep sleep 


By Yuji Ikegaya*? and Nobuyoshi Matsumoto! 


emories of recent experiences are 

transferred and fixed in long-term 

storage in the neocortex during 

sleep. A key aspect of this process 

is the occurrence of ripples, which 

are high-frequency oscillations in 
neuronal activity in the hippocampus (J, 
2). Ensembles of hippocampal neurons that 
have been activated during an experience 
emit synchronized neuronal activity (spikes) 
during ripples in subsequent sleep (3). These 
“replayed” memories propagate to the neo- 
cortex during slow-wave states (4), which are 
characterized by alternating “bright” states 
with active neurons and “dark” states, called 
delta waves, without active neurons (5). Re- 
searchers reasoned that hippocampal-corti- 
cal dialogue occurs during the bright states 
and that delta waves represent intermittent 
neocortical “sleep” to recover from synaptic 
fatigue (6) or increased potassium conduc- 
tance (7) and have no active function. On 
page 377 of this issue, Todorova and Zugaro 
(8) challenge this view, demonstrating that 
a small number of neurons are reactivated 
during delta waves. 

Todorova and Zugaro recorded activity, 
both spikes from multiple neurons and lo- 
cal field potentials, simultaneously in the 
rat dorsal hippocampus and the medial 
prefrontal cortex, which is believed to store 
long-term memories. They reconfirmed that 
the prefrontal cortex becomes inactive dur- 
ing each delta-wave epoch but also discov- 
ered that a few neurons remain active and 
occasionally emit synchronous spikes dur- 
ing delta waves (which they called “delta 
spikes”) (see the figure). The neurons re- 
corded in this study constitute only a small 
portion of the total cells in the prefrontal 
cortex. Thus, it is important to ask how fre- 
quently delta spikes occur. The local field 
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Hippocampal- 
cortical dialogues 
during sleep 


The hippocampus communicates 
with the neocortex and 

activates experience-associated 
neocortical neuron ensembles 
during delta waves, low-cerebral 
activity states that emerge 
intermittently during slow-wave 
sleep. This selective activity 


focuses memory consolidation. & 


potential waveforms, which reflect the syn- 
chronous activity of a group of neurons, 
do not differ depending on whether delta 
waves are accompanied by delta spikes in 
the recorded neurons. Moreover, the prob- 
ability of detecting delta spikes is propor- 
tional to the number of simultaneously 
recorded neurons. Extrapolating these ob- 
servations suggests that delta spikes accom- 
pany virtually all delta waves. 

Delta spikes are not shots in the dark or 
remnants of ripples that accidentally fail to 
disappear during delta waves. Rather, delta 
spikes occur in spatiotemporally designed, 
specifically recursive configurations and 
seem to be actively generated in the brain. 
Todorova and Zugaro an- 
notated delta spikes as be- 
havioral correlates. As rats 
were trained in a spatial 
memory task, behaviorally 
relevant neurons became 
preferentially activated in 
delta waves during the fol- 
lowing sleep period. The 
neuronal replay in the pre- 
frontal cortex appears to be 
instructed by hippocampal ripples, because 
the set of prefrontal neurons that emit a 
delta spike in a given delta wave can be 
statistically predicted from the set of hip- 
pocampal neurons activated in the ripple 
immediately preceding the delta wave. The 
spike correlations of cell pairs between the 
hippocampus and the medial prefrontal 
cortex remain high for hundreds of milli- 
seconds, indicating that hippocampal-cor- 
tical dialogue occurs within a short time 
frame, during which the neocortex serves 
as a temporary reservoir of hippocampal 
information. As a result, memory-relevant 
delta spikes are fired mainly in the early 
phase of each delta-wave event. 

Delta spikes consist mainly of behavior- 
ally relevant neurons, whereas behaviorally 
irrelevant neurons are more likely silenced 
during delta waves. Reducing irrelevant 
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Slow-wave sleep 


“.udask performance 
in mice is enhanced 
when artificial 
delta waves...are 
repeatedly induced...” 


. 6 


information improves the signal-to-noise 
ratio of neuronal information and thereby 
helps isolate cortical computations to avoid 
deleterious cross-talk. That is, the neocortex 
quiets down during delta waves and focuses 
attention on the content to be memorized, 
consistent with the authors’ previous pre- 
diction that delta waves “isolate target 
synapses from competing inputs, allowing 
selective reorganization of the network” (9). 
This leads to the possibility that facilitating 
delta spikes in ripple-associated neurons 
could enhance memory consolidation. Con- 
sistent with this idea, task performance in 
mice is enhanced when artificial delta waves 
containing ripple-coupled delta spikes are 
repeatedly induced using 
brief single-pulse electrical 
stimulation of deep cortical 
layers during sleep after an 
object-discrimination task. 
This suggests that delta 
spikes mediate memory 
consolidation. 

How do delta spikes 
contribute to memory con- 
solidation? It is unknown 
what delta waves give rise to in prefrontal 
neuronal circuits. Delta spikes are tran- 
sient and thus are unlikely to work as per- 
sistent memories by themselves. So, what 
form of long-lasting plasticity is induced 
by delta spikes? Moreover, activity patterns 
in individual task-associated neocortical 
neurons change over a period of days after 
learning (10). Do delta spikes also have an 
instructive role in the dynamic reorganiza- 
tion of neuronal identities? Another line 
of evidence suggests that delta waves are 
involved in forgetting rather than memory 
consolidation (17). Thus, drifting delta 
spikes could contribute to deactivation of 
memory traces. 

Another question is how the dorsal hip- 
pocampus and the medial prefrontal cor- 
tex communicate during slow-wave sleep. 
Direct synaptic connections are unlikely 
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to exist between these brain regions. On 
the basis of anatomical evidence of axonal 
projections, candidates for the relay sta- 
tion include the ventral hippocampus (12), 
a neocortical area via the entorhinal cortex 
(13), and the retrosplenial cortex via the 
subiculum (/4). Recording or ablation of 
neuronal activity in these brain regions is 
necessary to answer this question. 

Delta waves are a hallmark of the entire 
neocortex during sleep, and delta spikes 
may occur in cortical regions other than the 
medial prefrontal cortex. Do delta spikes 
have a role in processes other than memory 
consolidation? Todorova and Zugaro have 
successfully linked their unexpected dis- 
covery to the current knowledge of memory 
and sleep, but future investigations are still 
required to uncover the precise mecha- 
nisms and functions of delta spikes. 
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Rapid reorganization of global biodiversity 


Marine systems outpace terrestrial habitats in biodiversity erosion 


By Britas Klemens Eriksson! and 
Helmut Hillebrand2? 


wenty-five years of research on the 

relationship between biodiversity 

and ecosystem function have re- 

vealed that biodiversity drives fun- 

damental ecosystem processes and 

regulates their temporal and spatial 
stability (7, 2). Despite clear signs that hu- 
man efforts have failed to halt global bio- 
diversity loss (3, 4), it has been difficult to 
identify corresponding signs of global-loss 
trends in the context of local ecosystems 
(5-9). On page 339 of this issue, Blowes et 
al. (10) report their analysis of local bio- 
diversity changes using a large dataset of 
>50,000 biodiversity time series from 239 
studies. Each time series represents a rec- 
ord of species composition at a selected 
site over time, with sites representing all 
major ecosystem types and climatic zones. 
The authors demonstrate that the identi- 
ties of species and their abundances are 
being rapidly reorganized. 

Although the proportion of reported 
global extinctions remains small compared 
with previous mass extinctions, the cur- 
rent decline in the abundance of myriad 


species and the elevated danger of their ex- 
tinction have raised scientific concern over 
the ill effects of biodiversity loss on human 
health and well-being (3). The controversy 
over global versus local changes in biodi- 
versity arose from comprehensive analyses 
of trends in species numbers worldwide; 
the data revealed that local communities 
show strong species turnover but no sys- 
tematic decrease in the total number of 
species—a metric referred to as species 
richness. In fact, most habitats undergo 
replacement of some or many species by 
other distinct species with no general shift 
in the total number of species (5, 6). Even 
coastal time series show a systematic in- 
crease in average species richness over 
time, and declines are observed only in lo- 
cations with strong human pressures (7). 
Together, these studies (5-7) initiated 
a debate on why the average number of 
species in local communities has not (yet) 
been negatively affected by the global ero- 
sion of biodiversity (5-9, 11). A key lesson 
learned from this debate is that biodiver- 
sity change represents much more than a 
change in the average number of species. 
Species richness in a local habitat re- 
flects the net difference between the num- 


ber of species immigrating into a habitat 
and the number becoming extinct over 
time, and it is not a sensitive measure of 
biodiversity change (6, 7), which also in- 
cludes shifts in species composition and 
abundances of individual species types 
(see the figure). A recent supporting study 
found that a complete exchange of species 
composition over time without any change 
in the number of species (that is, species 
richness) commonly occurs in nature (17). 
The same study challenged the expecta- 
tion that changes in ecosystem conditions 
(for better or worse) will be reflected by 
simple trends in the number of species; in 
fact, this hypothesis invokes an equilibrium 
view of biodiversity that might rarely be 
achieved. A habitat undergoing environmen- 
tal change can become suitable for new spe- 
cies, but their impact on resident species will 
take time to manifest (see the figure). Thus, 
native species are likely to survive for some 
time before they are excluded (for example, 
through competition). Ecological systems ex- 
posed to continuous environmental change 
can, therefore, be expected to show a system- 
atic increase in the number of species if the 
rate of immigration is faster than the rate of 
extinction (so-called extinction debt) (72). 
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The Pacific oyster is native to Pacific Asia but has 
spread through accidental introductions across the 
world. Today, it is found on shores in Australia, Europe, 
New Zealand, and North America. 


The need to monitor biodiversity changes 
by addressing not only species identities 
but also their abundances is intensified by 
a traditional problem in the ecological and 
environmental sciences: Most data from rel- 
evant monitoring programs are locked up 
in governmental programs and institutions 
and are not openly accessible by scientists 
and the public. To address this serious de- 
ficiency, Dornelas and colleagues developed 
a community-led open-source database of 
biodiversity time series called BioTIME, 
with the aim of promoting robust analyses 
of changes in global biodiversity (13). 

Using BioTIME, Blowes et al. pre- 
sent a dynamic perspective of biodiver- 


sity changes that will shift the discussion 
on this topic. They show that the rate of 
change is highly context-dependent, but 
they also demonstrate increasing instability 
of biodiversity, which raises concerns in an 
era of global change. In particular habitats, 
the number of species either increased or 
decreased over time, but when assessing 
all habitats simultaneously, the authors ob- 
served an increase in the average number 
of species across the globe. The most no- 
table finding by Blowes et al. was that lo- 
cal species assemblages were being rapidly 
reorganized over time, with nearly a third 
of all species being replaced by new species 
every decade; these changes in biodiversity 
were stronger in marine systems than in 
terrestrial ones. Their study thus highlights 
that the global biodiversity crisis, at least 
for now, is not primarily about decline but, 
rather, about large-scale reorganization. 


Interplay of local and regional diversity alters environment 
Three biodiversity scenarios occur at different times in history: equilibrium scenario (past; no global 
change in biodiversity), current scenario (moderate global change), and future scenario (both regional 
and local richness decrease over time). Each symbol and color depicts a distinct species. 


Regional biodiversity (past) 


Local biodiversity 


Equilibrium scenario 
Abalance among species immigration, 


Immigration Oo A emigration, and extinction events 
L_ eS) rv regulates regional biodiversity. Members 
v of the regional species pool pass 
2Colonization, | fa - through an environmental filter and 
{— = ie A colonize local species assemblages. 
x 
x 
Extinction wen 
Emigration 


Regional biodiversity (current) 


Current scenario 
Environmental factors cause the 


Lan sty > a 2 growth of some species to be favored 
| * * 
oA A d over that of others. Because the 
© 2 © Tr] oh ° environmental impact takes time to 
@ oO 4 ry oO ae manifest, threatened species likely will 
e¢ ° P| a ca *<S somes ~ oA survive for a while before becoming 
@ e °° * He extinct. This leads to an increase in 
o® a o OA 4 on local species richness despite regional 
x ey * erosion of biodiversity. 
Biological 
homogenization 
e909 
Regional biodiversity (future) Local biodiversity Future scenario 
© With prolonged human impact, 
* wy environmental change intensifies, and 
* Ky Pek species resilience erodes. Populations 
* * a o? Ps of new, better-adapted species grow, 
* wk #eA?O ea 5 o. x * and regional extinctions intensify. If a 
| x eos ---=-*<------- ane local region is isolated or the global pool 
Ba A * * x @ ‘ * * lacks a richness of adapted species, 
An || wk eo? Sk immigration cannot compensate for 
¢ xo biodiversity loss. 


Dg 
Biological 
homogenization 


SCIENCE sciencemag.org 


Published by AAAS 


These findings raise questions about 
species homogenization and adaptability 
(14), as well as an expectation of precari- 
ous ecosystem services in the future, when 
a large reorganization of local species as- 
semblages affects the temporal stability 
of ecosystem functions (see the figure). 
Human-mediated introduction of new spe- 
cies might increase local species richness 
by elevating immigration rates, but at the 
same time it promotes biological homoge- 
nization—the process by which the compo- 
sition of species in an ecosystem becomes 
more uniform. Homogeneity in local spe- 
cies leads to erosion of biodiversity over 
larger regions (15) and can limit the ability 
of species communities to adapt to future 
environmental changes if immigration 
rates decline (/4). Thus, local increases in 
species richness might even correlate with 
regional declines in ecosystem resilience 
over time. Furthermore, by showing high 
temporal turnover of species composition 
for all possible trends in species richness, 
Blowes et al. confirm that such trends are 
insufficient to describe the consequences of 
global change for ecological communities. 

The study of Blowes et al. also has two 
implications for ecology as a scientific disci- 
pline: It brings together authors who origi- 
nally had highly divergent views on local 
biodiversity change, and it demonstrates 
the importance of open access to biodiver- 
sity data of sufficient quality. Only the high 
quality of BioTIME data enabled Blowes 
et al. to clearly identify geographical regions 
that experienced more or less change in bio- 
diversity than expected, which will affect the 
setting of conservation priorities. With open- 
source access to international databases, the 
scientific community can more precisely 
monitor the biodiversity crisis and develop 
local solutions. 
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SYNTHETIC BIOLOGY 


Technological challenges and 
milestones for writing genomes 


Synthetic genomics requires improved technologies 


By Nili Ostrov’, Jacob Beal’, Tom Ellis*, D. Benjamin Gordon’, Bogumil J. Karas®, Henry H. Lee’, 
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ngineering biology with recombinant 

DNA, broadly called synthetic biol- 

ogy, has progressed tremendously in 

the last decade, owing to continued 

industrialization of DNA synthesis, 

discovery and development of mo- 
lecular tools and organisms, and increas- 
ingly sophisticated modeling and analytic 
tools. However, we have yet to understand 
the full potential of engineering biology 
because of our inability to write and test 
whole genomes, which we call synthetic 
genomics. Substantial improvements are 
needed to reduce the cost and increase the 
speed and reliability of genetic tools. Here, 
we identify emerging technologies and im- 
provements to existing methods that will be 
needed in four major areas to advance syn- 
thetic genomics within the next 10 years: 
genome design, DNA synthesis, genome 
editing, and chromosome construction (see 
table). Similar to other large-scale projects 
for responsible advancement of innovative 
technologies, such as the Human Genome 
Project, an international, cross-disciplinary 
effort consisting of public and private enti- 
ties will likely yield maximal return on in- 
vestment and open new avenues of research 
and biotechnology. 

The ability to design and write genomes 
of living cells provides distinctive oppor- 
tunities to tackle problems that are in- 
tractable with current technologies. These 
transformative technologies could have 
widespread scientific, social, and eco- 
nomic impacts; thus, their development 
and adoption require proactive identifica- 
tion of potential pitfalls through ongoing 
public discussion. Accordingly, the outlook 
presented here is part of a broader effort 
by the international Genome Project-write 
(GP-write) consortium (7) to encourage 
inclusive conversation among scientists, 
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lawyers, ethicists, educators, environmen- 
talists, other experts, and stakeholders, 
as well as the general public, and ensure 
responsible, safe, and coordinated imple- 
mentation of these new technologies. 

Synthetic genomics is a relatively new 
field, and the majority of writing technolo- 
gies (with the exception of commercial DNA 
synthesis) are still developed by academic 
research laboratories. Thus, unlike the rea- 
sonably predictable progress of engineer- 
ing in more established industries, such 
as semiconductors, prediction of timelines 
and costs in these nascent fields remains 
highly speculative (see table). 


GENOME DESIGN 

Genome design aims to encode higher-level 
design criteria into DNA sequences at the 
chromosome scale. This will require com- 
puter-aided design (CAD) technologies to 
(i) reliably produce desired phenotypes; (ii) 
maximize the impact of the design, both in 
terms of experimental feedback and techni- 
cal feasibility; and (ii) facilitate collabora- 
tion by employing standards for handling 
and exchanging design information. Current 
synthetic genomic CAD software, such as 
used for Sc2.0 (2), uses automation and col- 
laboration tools for scaling DNA design from 
plasmids to entire chromosomes. Currently, 
estimation of functional effects of edits (e.g., 
gene deletions) is left to human experts. 
Looking forward, these tools will need to ac- 
curately predict the viability and phenotype 
of a cell from its genome design. 

Although simple models are sufficient to 
handle silent edits (e.g., synonymous edit- 
ing of coding sequences for watermark- 
ing), models of increasing complexity and 
precision will be necessary to predict how 
changes to genome sequence affect gene reg- 
ulation and protein function. Although com- 
prehensive mechanistic models of higher 
eukaryotes are likely decades away, machine 
learning approaches could improve pheno- 
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type prediction by leveraging the wealth of 
high-dimensional, high-throughput systems 
biology data in public databases and gener- 
ated from genome writing projects, similar 
to how such techniques have been used to 
predict protein structures. 

Leveraging these models, there is a need 
for experimental design tools to minimize 
the number of expensive iterations required 
to obtain successful genome designs. For in- 
stance, redesign of the relatively small Myco- 
plasma genome required four iterations, as 
well as genomic screens to identify essential 
genes (3). Larger projects will require many 
more intermediate stages of construction 
and supporting experiments. To provide the 
most valuable feedback and leverage the re- 
sulting data in subsequent designs, new al- 
gorithms are required to automate design of 
experiments and selection of appropriate en- 
gineering technologies for their implementa- 
tion (e.g., “write versus edit”). 

A related, unaddressed need is to ensure 
compatibility of designed DNA with con- 
straints from downstream synthesis, as- 
sembly, delivery, and analytical stages (e.g., 
software-guided parsing of chromosome se- 
quence into synthesis-compatible fragments, 
or introduction of designated sequences to 
facilitate assembly and delivery). Tools will 
need to be adaptable to anticipate advances 
in writing technologies. 

All of these efforts rely on and generate 
large datasets, which require stewardship 
to streamline model definition and facili- 
tate sharing of results. Two major barriers 
for integrating biological information are 
data incompatibility and a lack of suffi- 
ciently descriptive metadata. We encourage 
researchers to use widely adopted data-ex- 
change formats, e.g., GenBank and general 
feature format, and to continue to establish 
and adhere to standards for experimental 
metadata, e.g., the Synthetic Biology Open 
Language (SBOL). 

Funding agencies and industrial stake- 
holders should prioritize support for long- 
term development of software and standard 
data formats, including collaboration, visu- 
alization, and quality control capabilities, 
similar to initiatives already in place for 
genomics, such as the Wellcome Trust Open 
Research Fund and the Chan Zuckerberg Bio- 
Hub. Synthetic genomics software will not 
only enhance our ability to plan and execute 
large-scale genetic projects, but will likely 
also lead to fundamental methodological ad- 
vances to move from correlation to causation 
of genotype-phenotype relationships. 


DNA SYNTHESIS 

Genome writing projects depend on large 
numbers of long [>5000 base pairs (bp)] and 
precise synthetic DNA constructs (4, 5). How- 
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Key challenges and milestones for synthetic genomes 


ESTIMATED 
KEY TECHNOLOGY DEVELOPMENT TARGET EXAMPLE OF DESIRED MILESTONES TIME (YEARS) 
Genome design 
Develop tools for genome-scale design, Design a virus-proof mammalian chromosome. 3 
visualization, and quality control. 
ntegrate structural information (2D and 3D) Predict the conformation of a synthetic 5 
into genome design software yeast. chromosome. 
Develop sequence-to- phenotype whole- cell Optimize metabolic profile, accurate to 10 
modeling. within twofold, for 100 key gene products 
of a synthetic virus-proof chromosome. 
DNA synthesis 
ncrease coupling efficiency for oligonucleotide Synthesize high-fidelity oligonucleotides longer 3 
synthesis. than 500 nucleotides. 
ncrease efficiency of in n vitro DNA assembly for frag- Assemb e 20 kb with >50% yield. 4 
ments >20 kb. 
Develop methods for synthesis of difficult Synthesize a centromere. 5 
sequences, including homopolymers, high-GC 
content, and secondary structure. 
Develop enzymatic methods for direct synthesis Synthesize a 10-kb fragment (without ¥ 
of multikilobase DNA fragments. assembly). 
Decrease cost of DNA synthesis by 1000-fold. Synthesize and assemble DNA for ¢ one re haploid 10 
human genome (i.e., 3.2 x 10° bases) for $1000. 
Genome editing 
Expand multiplexity and precision of DNA editing. Simultaneously edit 1000 different targets in 2 
a single bacterial, mammalian, or plant cell with 
1 off target hit per 10, 000 genomes. 
Increase efficiency of homologous-directed repair Perform HDR- mediated edi ting in nondividing 3 
(HDR)-mediated editing in mammalian and plant cells. _ mammalian cells at >90% efficiency. 
Develop editing enzymes for precise substitution Perform allele editing of hu man cells at sites 5 
of any nucleotide at any desired genomic locus, lacking PAM sequence, with >95% efficiency. 
with increased efficiency. 
Chromosome construction 
Develop methods for temporal and spatial control Engineer segregating, stable human artificial 2 
of single chromosomes, such as chromatin state. chromosomes (HAC). 
Develop specialized host cells with high Establish in vivo chromosome assembly. 5 
efficiency for DNA assembly, particularly for methods in the host Streptomyces coelicolor 
difficult- to- assemble sequences. (72% GC content). 
Develo p efficient, inexpensive methods for routine and Demonstrate routine, device- based chromosome 3 
automated delivery of entire chromosomes into cells. delivery in mammalian cells by cell fusion. 
Develop methods for assembly and testing of Assemble a synthetic recoded human 10 


Mb-size chromosomes. 


ever, chemical synthesis of DNA remains lim- 
ited to production of short oligonucleotides 
(oligos), commonly 200 bp long. Although 
oligos have driven major advances in recom- 
binant DNA technologies, larger DNA con- 
structs require assembly of multiple oligos, a 
process that is laborious and lossy. Routine 
production of long, precise fragments of syn- 
thetic DNA would be desirable. 

Although DNA has become more available 
through commercial vendors in recent years 
as a result of industrialization by paralleliza- 
tion and miniaturization, there has been little 
improvement to the underlying phosphora- 
midite chemistry, which limits DNA length, 
production speed, and cost. Accordingly, 
construction of whole chromosomes remains 
expensive and time-consuming. For example, 
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chromosome 21 from DNA fragments. 


array synthesis of oligos costs approximately 
$0.0005 per nucleotide, yielding an estimate 
of $1.5 million for synthesizing 3 gigabases of 
DNA—the size of a human genome. Radical 
new approaches to DNA assembly, purifica- 
tion, and synthesis processes are thus re- 
quired to achieve substantial advancement 
on cost and ease. 

Innovations to minimize or eliminate the 
need for assembly, error correction, and 
cloning of DNA fragments assembled from 
oligos could boost productivity of current 
DNA synthesis infrastructure. To increase 
the yield of perfect sequences, currently 
ranging from 5 to 60% (6), hybridization 
and error correction can benefit from the 
engineering of high-fidelity polymerases 
and ligases. These advancements, largely 
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driven by industry, will decrease operat- 
ing costs and production time. Cloning 
efficiency can also be enhanced by harness- 
ing hosts with rapid division and/or high 
recombination rates, or obviated by using 
cell-free cloning and artificial cells. These 
technologies require fundamental research 
before they reach commercial readiness. 

New technologies capable of synthesiz- 
ing high-quality long DNA fragments would 
fundamentally alter chromosome-scale en- 
gineering. Recently, rapid production of 
short sequence-defined single-stranded DNA 
(ssDNA) has been reported by using the 
template-independent DNA polymerase TdT 
(terminal deoxynucleotidyl] transferase) (7, 8). 
TdT offers the potential for directly synthesiz- 
ing multikilobase sequences with increased 
polymerization rate and higher coupling 
efficiencies. To compete with existing phos- 
phoramidite chemistry, enzymatic synthesis 
approaches should be further developed to 
address complex sequences and achieve pre- 
cise, high-quality DNA in an automated and 
cost-effective fashion. These efforts, ripe for 
startup company attention, will benefit from 
continued investment in fundamental re- 
search to elucidate molecular mechanisms of 
enzymatic terminal transferase reactions. 

To support the scale and quality of DNA 
required for genome-scale projects, enhance- 
ment of electromechanical systems as well 
as innovative biological tools are needed. 
Continued increase in throughput may be 
achieved by further parallelization and min- 
iaturization—for example, by semiconductor 
fabrication or droplet-based techniques. In- 
creases in DNA quality and production speed 
will be influenced by use of biological tools 
such as enzymes and organisms. 


GENOME EDITING 

Powerful new DNA editing tools have low- 
ered the technical barriers for performing 
highly precise genetic and epigenetic modi- 
fications. Multiplexed editing of an intact 
genome could sharply decrease the time 
and labor required to generate a large num- 
ber of modifications and, in some instances, 
circumvent the need for de novo synthesis 
and chromosome assembly. 

Yet, despite considerable success with pro- 
grammable nucleases such as Cas9, TALEN, 
and ZFN in multiple cell types with exqui- 
site temporal and tissue-specific control, 
genome-scale editing remains limited. A 
localized, nuclease-induced double-strand 
break can be used to increase editing effi- 
ciency at each locus, but multiple simultane- 
ous breaks often cause cellular toxicity. To 
avert toxicity, “base editor” enzymes were 
engineered in which the nuclease is replaced 
by enzymatic base modification (9), achiev- 
ing simultaneous editing of over 13,000 Alu 
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repetitive elements in human cells by using 
a small number of guides (J0). Other engi- 
neered Cas9 tools are used for repression, 
activation, or targeted insertion of DNA. 
However, a major bottleneck for multiplex 
genome-wide editing remains the delivery 
of guide RNAs (gRNAs), as multiple unique 
genome changes necessitate the presence of 
multiple unique gRNAs in the same cell. We 
anticipate that this barrier, as well as off-tar- 
get mutagenesis and constraints caused by 
sequence specificity of editing enzymes [e.g., 
protospacer adjacent motif (PAM) sequence 
requirement], will be overcome to enable 
routine multiplex editing. 

Genome-scale editing can also be accom- 
plished by oligo recombineering (77), which 
relies on homologous recombination (HR) 
and has reduced in vivo toxicity. However, 
this technique is currently limited to a hand- 
ful of organisms in which high-efficiency 
HR can be catalyzed by a recombinase that 
uses a donor ssDNA for targeting. To enable 
recombineering to edit plants and mamma- 
lian cells, new recombinase enzymes must 
be discovered or designed. It may also be 
necessary to map and modulate an organ- 
ism’s repair pathways to improve HR. 

Additionally, comprehensive suites of 
molecular tools should be generated to 
accelerate testing and optimization of ge- 
nome editing. For example, programmable 
TALEN or ZFN nucleases can be generated 
for targeting all UAG stop codons in human 
cells. Similarly, CRISPR-Cas9 guide librar- 
ies targeting all PAMs can be used to ex- 
plore multiplexed, allele-specific targeting 
in plant, human, or fungal cells. Efforts to 
generate these genome-wide resources will 
provide experimental evidence of “accessi- 
bility maps” that reflect editing efficiency 
variability across genomic targets. Such 
data will optimize the choice of target se- 
quences, inform predictive computational 
models, and deepen our knowledge of chro- 
mosome structure, folding, and repair. 


CHROMOSOME CONSTRUCTION 
The most critical hurdle facing synthetic ge- 
nomics is the assembly and introduction of 
synthetic chromosomes into host cells. How 
does one stitch together all the DNA pieces 
required to construct a fully functional chro- 
mosome? Once constructed, how can we 
control chromosome localization and archi- 
tecture to ensure cell viability? How do we 
replace all chromosome copies in polyploid 
organisms? As the genomes of most free-liv- 
ing organisms are larger than 2 Mb, methods 
for routine manipulation of large DNA frag- 
ments are critically needed. 

Despite recent improvements in DNA 
synthesis and in vitro cloning, such meth- 
ods are not efficient for construction of en- 
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tire chromosomes. Higher-order assembly 
of chromosomes at least 1 Mb in length can 
be performed by in vivo HR in the yeast Sac- 
charomyces cerevisiae—a robust technique 
used in all synthetic chromosomes reported 
to date, including viral, bacterial, yeast, and 
algal chromosomes, as well as fragments of 
mice and human genomes (72, 13). 

The efficiency of DNA assembly in S. cere- 
visiae has not been found in other genetically 
tractable organisms. To expand the toolkit for 
writing specialized synthetic chromosomes 
that are difficult to assemble in S. cerevisiae, 
new HR-proficient cloning organisms should 
be developed that tolerate high GC, direct re- 
peats, and desired posttranscriptional modifi- 
cations. Organisms compatible with extreme 
environments such as desert, deep ocean, or 
space travel may provide new routes for DNA 
assembly, such as found in the polyextremo- 
phile Deinococcus radiodurans. 

Once a chromosome is constructed, deliv- 
ery and manipulation become the primary 
engineering bottleneck in most desired hosts. 
To deliver megabase-scale constructs, ro- 
bust, high-throughput DNA transformation 
methods must be developed in a variety of 
organisms spanning genera. For example, 
breakthroughs in DNA delivery can revolu- 
tionize plant engineering, which is currently 
hindered by species-specific, labor-intensive 
transformation methods and limited by tra- 
ditionally conservative funding. High-risk, 
high-reward funding to support moderniza- 
tion of plant research, such as development 
of tissue-culture-independent DNA delivery 
methods, is pertinent for synthetic biology 
enabled improvements of agricultural or- 
ganisms. Automation of specialized meth- 
ods for chromosome transfer between yeast, 
bacteria, plants, and mammalian cells (such 
as cell fusion, genome transplantation, or 
microinjection) requires multidisciplinary 
funding opportunities aimed at bridging 
microfluidics with traditional cell and mo- 
lecular biology work. It is essential that early 
proof-of-concept efforts be supported at gov- 
ernment and foundation levels. 

Many cellular forces that shape genome 
structure and function remain largely un- 
known. Fundamental studies are needed to 
elucidate mechanisms by which sequence 
and epigenetic regulation guide inter- and 
intrachromosomal interactions and de- 
termine genome architecture. Emerging 
technologies for programmable modifi- 
cations of chromatin, such as insulators 
guiding chromatin remodeling, safe harbor 
sites for DNA insertion, and orthogonal re- 
combinase enzymes, will be necessary for 
developing gene therapies (/4). Better un- 
derstanding of organelle genomes (plastid, 
mitochondrion), which remain extremely 
difficult to engineer, would offer new routes 
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for stable maintenance and incorporation 
of artificial chromosomes. 

A final challenge when introducing syn- 
thetic constructs is to quickly determine 
whether they perform as desired in the desti- 
nation cell. Whole-genome DNA and RNA se- 
quencing will serve as a first-pass verification 
of chromosome integrity. Tailored cell lines 
with phenotypic reporters may be developed 
for assessing the performance of large syn- 
thetic constructs. Reliable organoid models 
and a clear understanding of regulation and 
expression changes that drive organismal de- 
velopment will be key to extrapolating results 
from single cells to the design of functional 
chromosomes for multicellular organisms. 


GLOBAL AND MULTIDISCIPLINARY 

New technologies may come from efforts in 
synthetic biochemistry, such as program- 
mable synthetic protocells, from progress at 
the interface of hardware and wetware such 
as solid-phase DNA assembly platforms, or 
from findings in basic bioscience research— 
for example, by uncovering valuable new 
enzymes or delivery systems. Innovation will 
be driven by government grants and genomic 
and cancer institutes, with a growing role for 
BioFoundries, emerging hubs for automation 
of bioengineering. A highly interdisciplin- 
ary, multinational effort from government 
and private sectors will help achieve and dis- 
seminate these advances to make an impact 
in biomedical, pharmaceutical, agricultural, 
and chemical industries. 
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ENVIRONMENTAL STUDIES 


The work of 
words in the 
Anthropocene 


Two books—one fictional, 
one dictionary—probe 
how we conceive of our 
changing world 


By Deborah Dixon 


he books reviewed here dwell on the 
work of words in an “Anthropocene,” 
a term itself critiqued as hubris-filled, 
which glosses the question of who is 
responsible for the forcing of physical 
processes and the role of the nonhu- 
man in transforming a habitable planet. 
Both come from an arts and humanities 
tradition that worries at how we 
make sense of ourselves, the world, 
and our place within it—concerned 
that such contemplations are always 
partial, relational, and unfinished. 

A growing environmental arts and 
humanities literature and practice 
has emphasized the emotional and 
psychological impacts of an Anthro- 
pocene that foregrounds a series of 
endings—from the extinction of spe- 
cies to the curtailing of life on Earth— 
as well as a series of challenges, from 
resilience to survival. What words do 
we use to describe the conditions we 
live in, the challenges we face, and 
the futures we want? 

An Ecotopian Lexicon—“ecotop- 
ian” meaning out of place—offers 
a fascinating collection of non- 
English or newly invented words 
that impart something of the com- 
plexities of everyday life in an era 
of warming skies and oceans, mass 
degradation, precarity, and insecu- 
rity, each of which also helps map 
a possible future. The stated work 
of words here is to clarify, diagnose, 
and stimulate action. 


The reviewer is at the School of Geographical and 
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Some of the terms are already circulat- 
ing through academia and popular me- 
dia. These include “solastalgia,’ coined by 
Glenn Albrecht, which connotes a feeling 
of powerlessness and grief when faced with 
the sweeping change of a landscape that 
is still lived in, and “heyiya,” from Ursula 
Le Guin, which means to turn away from 
ecocidal practices. Some speak to 


“borrowing” of words, although mindful of a 
continued coloniality, nevertheless presents 
a welter of experiences, felt and articulated. 
The words of Extinction Events, although 
similarly situated in “endings,” do a dif- 
ferent kind of work. These words do not 
offer a map, nor do they seek to stir pas- 
sions toward the future. Rather, they evoke, 
somewhat in the vein of Italo Cal- 


an experience of place or its physi- 
cal transformation (“godhuli” being 


An Ecotopian 


vino’s The Complete Cosmicomics (1), 
a planetary history that winds its 


a particularly evocative example, ,, Cae way tightly into the lives of fictional 
meaning refracted sunlight seen Mayerson andg, women who face unsettling losses— 
through the haze of dust kicked up by Bellamy, Eds. | from the disappearance of islands to 
cows as they seek shelter). Some hint —_Universityof —_ the erosion of marital bonds. 

at earthly forces and cosmologies a tie What survives an_ extinction 
(such as “qi,” meaning a self-balanc- ibis event? What emotional and psy- 
ing order within the Universe that Extinction chical, as well as physical, traces 
connects all and subsumes chaos). Events remain? What does it feel like to 


One greeting, written as “~*~,” con- 
notes the feel of a light breath of air 
blown over the back of your hand; it 
is borrowed from dolphin societies to 
convey something of the electronic vibra- 
tions in which our bodies become enrolled. 

This is a book that wants to stir passions, 
which in turn become a means of realizing 
desired futures. As such, it resonates with 
the Twitter work of so many climate scien- 
tists, which hinges on the relaying of dire 
facts in tones that urge action. The lexicon’s 


The word “godhuli” refers to the Sun, as seen through dust kicked by cows. 
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Liz Breazeale 
University of 
Nebraska Press, 
2019. 131 pp. 


anticipate cataclysm? And to experi- 
ence the falling apart of maps that 
can no longer situate and orientate 
us? Breazeale’s stories press time 
and again on these questions, as the flesh 
of mother and daughter merges with the 
ash of volcanoes (“Ashcake”) and a memory 
of a kiss retains its Earthiness, even as it 
becomes a molecular cloud in the cosmos 
(“The Supernova of Irvin Edwards”). 

At times, the subsuming of a human lively 
time by the “deep time” of the Earth is a 
little heavy-handed, but the best 
of this collection is profoundly af- 
fective. “Devil’s Tooth Museum,’ 
for example, is a tour de force that 
ostensibly hinges on an effort to 
reopen an impact crater exhibition 
and museum center after the death 
of the protagonist’s sister, Syl. Syl’s 
presence haunts the site, prompt- 
ing the girls’ grandfather to further 
the physical breakdown of the ex- 
hibits, even as the narrator strives 
to mend them. In the dusty, Lysol- 
scented atmosphere, the violent 
extinctions wrought by the impact 
merge with the final breakdown of 
a simulator and the loss of a sib- 
ling: “A cataclysm, the air vibrating. 
In the panels, across the glass cases, 
the swarm of colors perforating the 
dark and in the tremors Syl, her im- 
print a shattered cone, a remnant, 
a brokenness that inhabited the in- 
nermost hollows of the Earth.” 

If An Ecotopian Lexicon makes 
futures with words, Extinction 
Events dwells on all of us as future 
fossils in the making. & 
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The stories that make us spend (and save) 


A Nobel laureate reveals how narratives shape the economy 


By Sonia Jaffe 


here did boycotts originate? Who 

were the first Luddites? When were 

rolling suitcases invented? Stories 

such as these have traditionally 

garnered less attention from econ- 

omists—who tend to focus on more 
easily quantified phenomena—than from an- 
thropologists, sociologists, or psychologists. 
Robert Shiller wants to change that. 

A Nobel laureate for his pioneering re- 
search on financial markets, Shiller argues 
in his new book, Narrative Economics, that 
the stories people tell can affect or even cause 
major economic events and therefore merit 
more attention from economists. 
By “narrative,” he refers to stories— 
and their underlying ideas or senti- 
ments—that are transmitted through 
“word-of-mouth contagion,” includ- 
ing print, digital, and social media. 

An analogy to epidemic dynamics 
runs throughout the book. Shiller 
references research from fields as 
diverse as marketing and neurosci- 
ence to describe how a narrative’s 
rate of “contagion” and “recovery” 
(how soon it is forgotten) are af- 
fected by human interest, celebrity, 
and a story’s visual imagery or abil- 
ity to evoke intense emotion. Conta- 
gion and recovery rates, he argues, 
affect the prevalence and durabil- 
ity—the virality—of the associated 
narrative. Narratives “evolve” over 
time, potentially changing in their 
contagiousness and their effect on 
people’s behavior. “Mutations” arise 
when, for example, a slight change in word- 
ing or an association with a celebrity changes 
how contagious a narrative is; a small, ran- 
dom event can have compounding effects 
that lead to disproportionate influence. 

Economic narratives—those that poten- 
tially affect people’s economic behavior— 
frequently include economic decisions that 
individuals who hear the story want to emu- 
late. Shiller begins with the timely example 
of Bitcoin: If you hear that those who bought 
Bitcoin early got wealthy and its price is in- 
creasing, you will probably want to buy Bit- 
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coin. Such narratives are often self-fulfilling 
prophesies. Others may simply be false, with 
more accurate versions or explicit correc- 
tions attracting far less attention. True or 
false, Shiller argues that narratives can com- 
plement or reinforce each other and that a 
constellation of narratives sometimes affects 
events in ways that no single narrative would. 

The bulk of Shiller’s discussion delves into 
nine narratives—or pairs of opposing narra- 
tives—that he identifies as having played im- 
portant roles in the economy over the past 
150 years, mainly of the United States and 
Europe. They range from concerns about 
technology eliminating jobs to tales of “evil 
businesses” raising prices to the competing 


When we hear early Bitcoin investors got wealthy, we want to buy Bitcoin. 


norms of conspicuous consumption and 
modest frugality. For each narrative, Shiller 
analyzes the rise and fall of salient phrases 
on Google Ngrams and in the ProQuest News 
and Newspapers database, but he focuses on 
the quotes and anecdotes that embodied the 
narratives at different points in history. The 
big events, such as the Great Depression and 
the Great Recession, will be familiar to most 
readers—the dozen other contractions and 
depressions since 1850 perhaps less so. For 
events big and small, the narratives Shiller 
presents based on contemporaneous pri- 
mary sources are much more engaging than 
the employment, gross domestic product, 
and stock market numbers that economists 
generally use to describe them. 
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In addition to urging researchers to 
analyze the vast data on digital com- 
munication of narratives, Shiller advo- 
cates digitizing and assembling existing 
texts from the past and actively collect- 
ing data—through interviews and focus 
groups—about narratives of the present. 
He hopes that understanding the influence 
of narratives will allow leaders 
to “create and disseminate coun- 
ternarratives that establish more 
rational and more public-spirited 
economic behavior.” He does not 
discuss the resemblance that such 
an intervention would have to gov- 
ernment propaganda, although he 
does acknowledge the partisan as- 
sociations of many narratives and 
the resulting challenges to objec- 
tive research. 

Shiller’s thorough discussion 
and many examples are certainly 
convincing as to the importance of 
narratives in individual economic 
decision-making and aggregate eco- 
nomic phenomena. However, most 
economists will likely want to focus 
on distinguishing narratives that 
cause economic events from those 
that are merely correlated with, 
or caused by, those events. Shiller 
acknowledges this challenge. Although 
the contagion process is “in many ways a 
random event,” there is no controlled ran- 
domization of narratives. Moreover, “big 
economic events usually can’t be described 
as caused by just a single constellation of 
narratives”; the interaction of competing or 
complementary narratives is an additional 
challenge for identifying causality. 

Nevertheless, Shiller is optimistic about 
the potential for researchers to combine the 
growing quantities of available data with 
improvements in semantic analysis to pro- 
vide the context necessary for distinguish- 
ing narratives’ causes from their effects. 


10.1126/science.aaz2690 
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Forest restoration: 
Overlooked constraints 


In their Report “The global tree restora- 
tion potential” (5 July, p. 76), J.-F. Bastin 
et al. use machine learning to derive the 
carbon storage potential of global tree 
restoration, which they identify as the 
most effective climate change mitigation 
option. However, the study likely overesti- 
mates the actual potential by identifying 
opportunities for increasing canopy cover 
in environments with obvious environ- 
mental or socioeconomic constraints. 

In high-latitude regions of Russia, 
Scandinavia, and North America, perma- 
frost and short growing seasons (J) impair 
tree growth. In large parts of Australia 
and other arid and hyperarid regions, 
salinity, sodicity, hardpans, and moisture 
limitations prevent tree establishment (2, 
3). In African grasslands, infertile soils, 
grazing animals, water constraints, and 
wildfires maintain patchy shrub-grass 
environments (4). In areas with severely 
degraded soils and biodiversity loss in the 
Americas and in Asia (5, 6), prospects of 
restoring pre-degradation canopy cover 
are limited. In grazing lands and produc- 
tion forests, abandoning current uses 
implies staggering absolute opportunity 
costs. Finally, Bastin et al. excluded areas 
classified as urban, but the data set they 
used (7) fails to recognize some major 
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urban centers and many towns and vil- 
lages in rural areas (7); more than 2.5 billion 
people live in areas that Bastin et al. consid- 
ered eligible for restoration (8), including 
entire cities, such as Kinshasa, the capital of 
the Democratic Republic of Congo. 

Bastin et al. introduced further over- 
estimation by multiplying tree cover 
expansion potential by total ecosystem 
carbon. This operation lowers the baseline 
by assuming that carbon stock is propor- 
tionally related to canopy cover—i.e., that 
land with no trees contains no carbon. 
The use of biome-level carbon stock 
averages, without considering spatial 
variation, also adds considerable error, 
especially in alleged high-potential areas, 
where these averages (154.7 to 282.5 Mg 
ha") are approximately 5 times greater 
than what has been reported in site-spe- 
cific assessments (9, 10). 

We appreciate the need for benchmark 
estimates of carbon storage and restora- 
tion potentials, but realistic predictions 
require tapping expert knowledge to 
ensure relevant constraints are consid- 
ered, as well as more rigorous quality 
control, such as mapping how model 
validation errors are spatially distrib- 
uted. Overly hopeful figures produced by 
models without necessary supervision 
may misguide the development of climate 
policy (11, 12). 
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Forest restoration: 
Expanding agriculture 


In their Report “The global tree restora- 
tion potential” (5 July, p. 76), J.-F. Bastin 

et al. determine the available potential 
forest restoration area by excluding areas 
with existing trees, urban settlement, and 
cropland. However, they overestimate the 
potential area because they do not account 
for projected agricultural land expansion 
or current use of pasture land. 

There is evidence from satellite imag- 
ery that most of global agricultural land 
expansion in the previous three decades 
happened and is still happening on 
tropical forest land, especially in Brazil and 
Southeast Asia (J—3). Given that this trend 
is likely to continue, especially in the highly 
productive areas in Central and South 
America, agricultural land expansion must 
be taken into account when assessing future 
tree restoration potentials (4-6). Food and 
Agriculture Organization projections expect 
an increase of cropland by 7% until 2030 
(7), and evidence suggests an increase in 
global cropland area between 11 and 26% 
until 2050 (8), the latter corresponding to 
4 million km2. Based on one approach (4), 
not using this area for crop production 
would reduce global crop production by 
11% and increase crop prices by 23%. 

Furthermore, Bastin et al. assume that 
grassland would be available for tree 
restoration. They choose to ignore the data 
showing that currently about 30 million 
km? of grassland areas are used for exten- 
sive livestock production (9). Not utilizing 
areas for cropland expansion and pasture 
land requires a higher intensification of 
agriculture, which in turn is associated 
with higher agricultural emissions (J0) 
and loss of biodiversity (4). 

Bastin et al. do not consider current and 
future trade-offs with food security and 
neglect socioeconomic aspects of increasing 
consumption that arise through population 
growth, income growth, and preference 
changes toward more livestock products 
in fast-growing economies (17). Excluding 
estimated expansion areas and grazing 
land reduces the calculated sequestration 
potential by 19 and 57%, respectively, when 
applying the carbon densities of the book- 
keeping model BLUE (12). 
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Forest restoration: 
Transformative trees 


We welcome the attention given to forest 
and trees by the Report “The global tree 
restoration potential” (5 July, p. 76), in which 
J.-F. Bastin et al. study the potential of tree 
cover to reduce climate change. However, we 
are concerned by their neglect of the water 
cycle. They consider how water influences 
tree cover but disregard how tree cover 
influences water. Bastin et al. recognize that 
their extrapolations are not “future projec- 
tions of potential forest extent” but instead 
represent potential tree cover “under exist- 
ing environmental conditions.” However, 


Published by AAAS 


Expanding agricultural land and livestock production could conflict with forest restoration goals. 


given the influence of forests on their 
environment, the concept of potential tree 
cover under current conditions is problem- 
atic. Trees influence several of the variables 
Bastin et al. used to model tree cover, 
including precipitation quantity, variability, 
and seasonality, as well as soil moisture and 
atmospheric water transport (J—4). 

While much remains uncertain (2), we 
know enough to foresee that afforestation 
and reforestation have potential for both 
negative and positive hydrological impacts. 
Negative impacts can result if plantings 
deplete groundwater and thus exacerbate 
local water scarcity. Changes can manifest 
quickly and are a recognized problem with 
fast-growing monoculture plantations (5). 
Positive impacts can result when tree cover 
improves soil and groundwater recharge 
and storage, such as through suitable spe- 
cies and tree densities (6). Forest cover can 
also promote rainfall recycling and thus 
bolster and stabilize regional and downwind 
rainfall (J, 7, 8). In suitable circumstances, 
increased forest cover may even return 
wetter climates to currently drier regions, 
expanding the land available for trees (2). 
These outcomes have profound implica- 
tions given that reliable access to water is 
central to achieving the UN Sustainable 
Development Goals. Accounting for the 
potentially transformative power of trees 
for both water and carbon offers crucial 
constraints as well as vast benefits. 
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Response 


Luedeling and colleagues argue that we 
have overestimated the restoration capac- 
ity in several regions of the world. Our 
model predicts the expected optimal tree 
cover from a combination of 10 envi- 
ronmental variables that were selected 
through a variable selection procedure 

to avoid overfitting issues. As detailed in 
table S1 of our supplementary material, 
these 10 variables include mean annual 
temperature, temperature of the wettest 
quarter, annual precipitation, precipi- 
tation seasonality, precipitation of the 
driest quarter, elevation, hillshade, soil 
organic carbon, sand content, and depth 
to bedrock. These ecological variables 
cover average and seasonal variation in 
climate and variation in topographic and 
edaphic conditions. As such, we have done 
everything that is possible to represent all 
of the conditions raised by Luedeling and 
colleagues. Of course, cold and dry condi- 
tions are among the main limitations for 
tree growth, and that is why we have rep- 
resented these environmental constraints 
in our model to ensure that we do not 
predict that trees can exist in regions that 
are too cold or dry. 

As explained in the main text, our 
rigorous k-fold cross-validation (fig. S4A) 
revealed that our model could explain 
about 71% of the variation in tree cover 
without bias (fig. $3, B and C). This means 
that our model is unbiased at a global 
scale, but we do not explain 100% of the 
potential tree cover variation. It is conse- 
quently possible to find places where we 
overestimate or underestimate the poten- 
tial tree cover—particularly in areas where 
uncertainties are high, as shown in fig S6. 

Delzeit and colleagues claim that we 
overestimate the area available for tree 
restoration because the expansion of crop- 
lands in upcoming decades will reduce the 
land available for restoration, and because 
pasture lands are considered as potential 
land for restoration in our assessment. 
We agree that, if we continue to expand 
agricultural land area, there will be a 
reduction in the land available for res- 
toration. As stated in our analysis, our 
model estimates the area that is currently 
available for restoration under present 
conditions. Of course, any changes in the 
area of land use will necessarily affect this 
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global total. We exemplified this in our 
attempts to show how future changes in 
climate might reduce the area available 
for restoration. We hope that our analy- 
sis can also serve as a stepping stone for 
future research to evaluate how changes 
in agricultural land use will affect the 
potential restoration area. 

It is true that we included rangelands 
in the area available for restoration. Of 
course, much of this land is used for the 
grazing of animals and so may not be 
available for complete forest restoration. 
However, as mentioned in the Report, 
several studies suggest that it is possible 
to increase the current tree cover in these 
areas without limiting food production 
CZ, 2), especially when forest cover is rela- 
tively low, as is the case for most of the 
pasture land in our model. 

Because we removed all urban and 
agricultural land (i.e., we considered a 
potential increase of tree cover of 0% in 
cropland and urban areas), our numbers are 
likely to underestimate the total area that 
could currently be covered by trees. Indeed, 
both croplands and cities constitute great 
opportunities to increase the current tree 
cover and to play a major role in mitigat- 
ing climate change (3-5). We maintain that 
our global estimate of the land available 
for restoration is a conservative one, and 
we encourage local land owners to use our 
forest restoration potential map in combina- 
tion with more detailed local-scale estimates 
of land use when designing effective resto- 
ration strategies. 

Sheil and colleagues point out that 
restoring ecosystems might have either 
positive or negative consequences regard- 
ing hydrology. We agree that these effects 
must be considered as a priority in 
upcoming research in restoration ecology. 
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TECHNICAL COMMENT ABSTRACTS 
Comment on “The global tree restoration potential” 


Pierre Friedlingstein, Myles Allen, Josep G. 
Canadell, Glen P. Peters, Sonia I. Seneviratne 
Bastin et al. (Reports, 5 July 2019, p. 76) 
claim that global tree restoration is the most 
effective climate change solution to date, 
with a reported carbon storage potential 

of 205 gigatonnes of carbon. However, this 
estimate and its implications for climate miti- 
gation are inconsistent with the dynamics of 
the global carbon cycle and its response to 
anthropogenic carbon dioxide emissions. 
Full text: dx.doi.org/10.1126/science.aay8060 


Comment on “The global tree restoration potential” 


Joseph W. Veldman, Julie C. Aleman, Swanni T. 
Alvarado, T. Michael Anderson, Sally Archibald, 
William J. Bond, Thomas W. Boutton, Nina 
Buchmann, Elise Buisson, Josep G. Canadell, 
Michele de Sa Dechoum, Milton H. Diaz-Toribio, 
Giselda Durigan, John J. Ewel, G. Wilson Fernandes, 
Alessandra Fidelis, Forrest Fleischman, Stephen 
P. Good, Daniel M. Griffith, Julia-Maria Hermann, 
William A. Hoffmann, Soizig Le Stradic, Caroline 
E. R. Lehmann, Gregory Mahy, Ashish N. Nerlekar, 
Jesse B. Nippert, Reed F. Noss, Colin P. Osborne, 
Gerhard E. Overbeck, Catherine L. Parr, Juli G. 
Pausas, R. Toby Pennington, Michael P. Perring, 
Francis E. Putz, Jayashree Ratnam, Mahesh 
Sankaran, Isabel B. Schmidt, Christine B. Schmitt, 


Fernando A. O. Silveira, A. Carla Staver, Nicola 
Stevens, Christopher Still, Caroline A. E. 
Stromberg, Vicky M. Temperton, J. Morgan Varner, 
Nicholas P. Zaloumis 

Bastin et al.'s estimate (Reports, 5 July 
2019, p. 76) that tree planting for climate 
change mitigation could sequester 205 
gigatonnes of carbon is approximately 
five times too large. Their analysis inflated 
soil organic carbon gains, failed to safe- 
guard against warming from trees at high 
latitudes and elevations, and considered 
afforestation of savannas, grasslands, and 
shrublands to be restoration. 

Full text: dx.doi.org/10.1126/science.aay7976 


Comment on “The global tree restoration potential” 


Simon L. Lewis, Edward T. A. Mitchard, Colin 
Prentice, Mark Maslin, Ben Poulter 

Bastin et al. (Reports, 5 July 2019, p. 76) 
state that the restoration potential of 

new forests globally is 205 gigatonnes of 
carbon, conclude that “global tree restora- 
tion [is] our most effective climate change 
solution to date,” and state that climate 
change will drive the loss of 450 million 
hectares of existing tropical forest by 2050. 
Here we show that these three statements 
are incorrect. 

Full text: dx.doi.org/10.1126/science.aaz0388 


Response to Comments on “The global tree 
restoration potential” 


Jean-Francois Bastin, Yelena Finegold, Claude 
Garcia, Nick Gellie, Andrew Lowe, Danilo Mollicone, 
Marcelo Rezende, Devin Routh, Moctar Sacande, Ben 
Sparrow, Constantin M. Zohner, Thomas W. Crowther 
Our study quantified the global tree 
restoration potential and its associated 
carbon storage potential under existing 
climate conditions. We received multiple 
technical comments, both supporting and 
disputing our findings. We recognize that 
several issues raised in these comments are 
worthy of discussion. We therefore provide 

a detailed common answer where we show 
that our original estimations are accurate. 
Full text: dx.doi.org/10.1126/science.aay8108 


Comment on “The global tree restoration potential” 


Alan Grainger, Louis R. Iverson, Gregg H. Marland, 
Anantha Prasad 

Bastin et al. (Reports, 5 July 2019, p. 76) 
neglect considerable research into forest- 
based climate change mitigation during the 
1980s and 1990s. This research supports 
some of their findings on the area of land 
technically suitable for expanding tree cover 
and can be used to extend their analysis to 
include the area of actually available land 
and operational feasibility. 

Full text: dx.doi.org/10.1126/science.aay8334 
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EXOPLANETS 
Peering inside extrasolar rocky bodies 


he oxygen fugacity of a rock, fo,, is a measure of how oxidiz- 

ing or reducing its surroundings were when the rock formed. 

Different minerals form at different fo, and have different 

physical properties, so the internal structure of an exoplanet 

depends on this value. Doyle et al. exploited the signature left 
behind when rocky bodies impact a white dwarf—the remnant of 
a dead star. By examining the rock-forming elements left on the 


surface of each white dwarf, they determine fo, in the impacting 
body. Six systems all had similar fo, to bodies in the Solar System, 
consistent with the idea that rocky exoplanets often have internal 
properties similar to those of Earth and Mars. —KTS 


Science, this issue p. 356 


Artist's vision of a white dwarf stripping a planet of its rocky outer layers 


CELL BIOLOGY 
Origins of collective 
contraction 


In contrast to plants and fungi, 
animals can deform their bod- 
ies by the collective activity 

of contractile cells. Collective 
contractility underlies pro- 
cesses such as gastrulation and 
muscle-based motility. Brunet 
et al. report that a close relative 
of animals, a choanoflagellate 
they name Choanoeca flexa, 
forms cup-shaped colonies that 
undergo collective contractil- 

ity, leading to a rapid change 

in colony morphology (see the 
Perspective by Tomancak). 

C. flexa colonies are each com- 
posed of a monolayer of polarized 
cells. In response to sudden 
darkness, a light-sensing protein 
triggers coordinated, polarized 
contraction of C. flexa cells, which 
results in colony inversion. The 
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cellular mechanisms underly- 

ing this process are conserved 

between C. flexa and animals, 

indicating that their last common 

ancestor was also capable of 

polarized cell contraction. —BAP 
Science, this issue p. 326; 

see also p. 300 


3D PRINTING 
Large-scale, 
continuous 3D printing 


Many three-dimensional 

(3D) printing methods build 
up structures layer by layer, 
which causes a lamination 
layer between each discrete 
step. Continuous printing can 
be done from a fluid bed if a 
so-called dead layer is used to 
buffer between the solidified 
structure and pool of resin. 
However, printing speeds are 
limited by the heat buildup from 
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the exothermic polymerization 
process, thus limiting the ulti- 
mate size of the printed object. 
Walker et al. use a pumped, 
nonreactive fluorinated oil to act 
as the dead layer that removes 
heat during polymerization. 
This approach allows for both 
speedup and scale-up of the 
printing process. —MSL 

Science, this issue p. 360 


Thermal imaging of a 3D-printed lattice 


Published by AAAS 


EARTHQUAKES 
Many ruptures across 
many scales 


The period of seismic quies- 
cence in Southern California 
was rudely interrupted by 

the Ridgecrest earthquake 
sequence in July 2019. Ross et 
al. mapped the slip sequences 
during the magnitude 6.4 and 


sciencemag.org SCIENCE 
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7.1 earthquakes that shook the 
region. They found that ruptures 
of a few larger, but many smaller, 
faults occurred during both 
earthquakes. The Ridgecrest 
sequence calls for rethinking 
seismic hazard, as multifault 
ruptures are not usually consid- 
ered when assessing seismic 
risk. —BG 

Science, this issue p. 346 


QUANTUM COMPUTING 
Generating large-scale 
cluster states 


The development of a practical 
quantum computer requires 
universality, scalability, and 
fault tolerance. Although much 
progress is being made in circuit 
platforms in which arrays of 
qubits are addressed and manip- 
ulated individually, scale-up of 
such systems is experimentally 
challenging. Asavanant et al. and 
Larsen et al. explore an alterna- 
tive route: measurement-based 
quantum computation, which is 
a platform based on the genera- 
tion of large-scale cluster states. 
As these are optically prepared 
and easier to handle (one simply 
performs local measurements 
on each individual component 
of the cluster state), such a 
platform is readily scalable 
and fault tolerant. The topol- 
ogy of the cluster state ensures 
that the approach meets the 
requirements for quantum com- 
putation. —ISO 

Science, this issue p. 373, p. 369 


GENE EXPRESSION 
Astatistical model 
to find disease genes 


Genetic variation is high among 
individuals, which makes it 
difficult to identify any one 
specific pathogenetic variant in 
patients with idiopathic disease, 
especially those that are in non- 
coding regions of the genome. 
Examining tissue-specific and 
population-level RNA sequenc- 
ing data, Mohammadi et al. 
developed a statistical test, 
analysis of expression varia- 
tion (ANEVA), that can quantify 
how one individual's gene 


SCIENCE sciencemag.org 


expression fits in the context of 
the variation within the general 
population. By applying ANEVA 
to a dosage outlier test, the 
authors identified pathogenic 
gene transcripts in patients with 
Mendelian muscle dystrophy. 
—LMZ 

Science, this issue p. 351 


SLEEP 
Need for sleep 


Sleep is crucial for healthy living 
and well-being, but individual 
sleep needs vary greatly. Xing 
et al. performed whole exome 
sequencing in a family of short 
sleepers and identified a point 
mutation in the neuropeptide 

S receptor 1 gene (NPSR1) 
responsible for the short sleep 
phenotype. The mutation 
increased receptor sensitiv- 

ity to the endogenous ligand. 
Mice carrying the mutation 
showed increased mobility time, 
reduced sleep duration, and 
resistance to sleep deprivation— 
induced cognitive impairment. 
NPSRI1 may thus play a major 
role in sleep-related memory 
consolidation. -MM 


Sci. Transl. Med. 11, eaax2014 (2019). 


FIBROSIS 
Sequencing in the matrix 


Biological scaffolds that mimic 
tissue microenvironments 
can be used to model wound 
healing. Using single-cell RNA 
sequencing in two distinct 
biomaterial environments, 
Sommerfeld et al. found that 
defined populations of mac- 
rophages play a role in both 
fibrosis and regeneration. A 
subset of CD9* interleukin- 
36y (IL-36y)—producing 
macrophages participate in 
L-17-driven fibrosis. By 
evaluating wound healing in 
L-17-deficient mice, they report 
L-17 to be essential for the 
generation of CD9* IL-36y- 
producing macrophages dur- 
ing fibrosis. Further studies 
are needed to understand the 
functional relationship between 
L-17— and IL-36y—producing 
cells in fibrosis and in other set- 
tings. —AB 

Sci. Immunol. 4, eaax4783 (2019). 


IN OTHER JOURNALS 


MEDICAL REGULATION 


Edited by Caroline Ash 
and Jesse Smith 


The United States regulates, 
the European Union benefits 


remarket testing of medical products can be costly but 
can also reduce uncertainty and improve public welfare. 
Grennan and Town studied prices and quantities for coronary 
stents implanted from 2004 to 2013 in many hospitals in 
the European Union (EU) (where only safety testing was 
required) and the United States (where both safety and effi- 
cacy testing were required). As products were undergoing 
U.S. clinical trials, volatility in their usage decreased and 
average usage increased in the EU. Neither usage 


pattern was seen in the EU for products not in 
U.S. trials, suggesting learning from U.S. trials. 
Although EU consumers gain some benefit 
from increased variety in products, there 
could be substantial gain from more 


premarket testing. —BW 


Am. Econ. Rev. 
www.aeaweb.org/ 
articles?id=10.1257/ 
aer.20180946 (2019). 


ENERGY STORAGE 
Ultra-low-temperature 
aqueous batteries 


The development of efficient 
electrochemical energy storage 
systems is of prime importance 
for future energy manage- 

ment, which will be increasingly 
based on renewable energy 
sources. Aqueous rechargeable 
alkali-ion batteries are par- 
ticularly attractive for large-scale 
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Mandated efficacy 
testing of coronary stents 
performed in the United States 
increased their usage in the EU, 
where such tests are not required. 


implementations given their 
environmentally friendly 
components and low cost. 
However, their low-temperature 
performance requires improve- 
ments, which can be achieved by 
optimizing the electrolyte’s ionic 
conductivity and freezing point. 
Using dimethyl sulfoxide as the 
electrolyte additive, Nian et al. 
develop a new practical electro- 
chemical system with electrolyte 
that freezes below -130°C. Asa 
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Studies on the baboon, Papio anubis, show how adverse life events affect subsequent generations. 


LIFE HISTORY 


Predicting offspring life span 
nvironmental and life choices affect health and reproduction. Several studies show that these 
effects extend to the next generation and sometimes beyond. Zipple et al. have examined a lon- 
gitudinal dataset of a natural population of baboons in southern Kenya. Individuals in this group 
can live up to 30 years. The dataset was collected over 45 years and documented life span and 
adversity experienced during the prereproductive life of females. Adverse events ranged from 

death of the mother when offspring were young, to surviving drought, to having siblings of close age 

or being born into a large social group. Individuals affected by adversity often outlived their offspring. 

These older mothers seemed less able to successfully rear their offspring, who commonly experi- 

enced high mortality in infancy even when free of adversity. -BAP_ eLife 2019, e47433 (2019). 


result, the battery operates with 
sufficient ionic conductivity even 
at -50°C. The reasons for such 
an ultra-low freezing point are 
analyzed by means of spectral 
characterization and molecular 
dynamics simulation. —YS 

Angew. Chem. Int. Ed. 10.1002/ 

anie.201908913 (2019). 


IMMUNOLOGY 
Gallstones—a force 
of NETure? 


Gallstones are a painful and 
sometimes fatal condition that is 
one of the leading causes of hos- 
pitalization in the United States. 
They are thought to form when 
bile is concentrated and acidified 
in the gallbladder, leading to the 
supersaturation and subsequent 
precipitation of cholesterol and 
calcium salts. Mufioz et al. report 
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that neutrophils release neutro- 
phil extracellular traps (NETs) 
after interacting with calcium or 
cholesterol crystals. These NETs 
contain extracellular DNA and 
elastase, which help fuel crystal- 
lization and gallstone assembly. 
Mice with defects in pathways 
required for NET formation 

were found to have gallstones of 
reduced size and number. One 
future therapeutic target for this 
condition may therefore lie in pre- 
venting neutrophils from casting 
their NETs. —STS 


Immunity 51, 443 (2019). 


PSYCHOLOGY 
Community-based 
policing 

Proponents of community-based 


policing argue that positive non- 
enforcement experiences with 
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police enhance citizens’ trust 
and belief in the legitimacy of 
policing. Peyton et al. conducted 
a randomized field experi- 

ment to determine whether 

a single positive interaction 
could improve attitudes toward 
police. Police were trained in 
positive engagement strate- 
gies and assigned to talk with 
citizens for 10 minutes ina 
nonenforcement capacity. 
Surveys that were administered 
later found that citizens who 
had talked to officers held more 
positive attitudes toward the 
police 3 days and 21 days after 
those conversations compared 
with citizens who did not speak 
with an officer. These results 
have implications for improving 
relationships between citizens 
and police. —TSR 


Proc. Natl. Acad. Sci. U.S.A. 116, 
19894 (2019). 


Published by AAAS 


MICROBIOLOGY 
Globe-trotting 
conjugal hitchhiker 


Plasmids transfer large gene 
clusters between distant bacte- 
rial relatives. They are notorious 
for spreading antibiotic resis- 
tance genes among pathogens. 
Petersen et al. wondered 
whether plasmid transfer 
among environmental bacteria 
is happening on a similar scale. 
Choosing Roseobacter, known 
for its genetic plasticity and 
importance in carbon and sulfur 
cycling in the oceans, the authors 
found a RepL-type plasmid 
called pLA6_12. This plasmid 
has 100% sequence identity in 
several genera from multiple 
locations. Consequently, its 
occurrence is indicative of recent 
global horizontal gene transfer 
events. pLA6_12 contains gene 
cassettes for detoxification of 
anthropogenic pollutants and, 
like antibiotic resistance plas- 
mids, seems to be selected for by 
human activities. —CA 

Proc. Natl. Acad. Sci. U.S.A. 116, 

20568 (2019). 


BIOCHEMISTRY 
Cooperating to 
control cadmium 


Cadmium is toxic to living 
organisms because it displaces 
essential metal ions in macro- 
molecules. Detoxification 
systems rely on sulfur-rich sites 
to coordinate the cadmium, but 
these sites also bind other heavy 
metals. Liu et al. show how the 
bacterium Pseudomonas putida 
eliminates cadmium. The protein 
CadR selectively responds to 
cadmium to bind to DNA and up- 
regulate transcription of itself and 
other cadmium detoxification 
proteins. The selectivity comes 
from cooperation between two 
types of binding sites. Cadmium 
first binds to a cysteine-rich site, 
and this stabilizes the second 
histidine-rich site in a conforma- 
tion that supports cadmium 
binding. Binding at both sites 
facilitates DNA binding to activate 
transcription. —VV 

Proc. Natl. Acad. Sci, U.S.A. 116, 

20398 (2019). 
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SIGNALING 
Mediating systemic 
health 


Sphingosine 1-phosphate (S1P) 
is an important circulating lipid 
mediator that is derived from 
the metabolism of cell mem- 
branes. Its diverse homeostatic 
roles, particularly in immunol- 
ogy and vascular biology, can 
go awry in numerous diseases, 
including multiple sclerosis, 
cardiovascular diseases, and 
fibrosis. The centrality of SIP 
signaling has led to the develop- 
ment of several drugs, including 
two approved for treatment of 
multiple sclerosis. In a Review, 
Cartier and Hla discuss the 
current understanding of how 
one mediator can carry out so 
many signaling roles in different 
tissues, how these become dys- 
regulated in disease, and efforts 
in drug development to target 
S1P signaling. —GKA 

Science, this issue p. 323 


HUMAN EVOLUTION 
Adaptive archaic 


hominin genes 
As they migrated out of Africa 
and into Europe and Asia, 
anatomically modern humans 
interbred with archaic homi- 
nins, such as Neanderthals 
and Denisovans. The result of 
this genetic introgression on 
the recipient populations has 
been of considerable interest, 
especially in cases of selection 
for specific archaic genetic vari- 
ants. Hsieh et al. characterized 
adaptive structural variants and 
copy number variants that are 
likely targets of positive selection 
in Melanesians. Focusing on 
population-specific regions of 
the genome that carry dupli- 
cated genes and show an excess 
of amino acid replacements 
provides evidence for one of the 
mechanisms by which genetic 
novelty can arise and result in 
differentiation between human 
genomes. —LMZ 

Science, this issue p. 324 


322-B 


NEURODEVELOPMENT 
Retinal neurons 
play musical chairs 


As the retina develops in fruit 
flies, subtypes of certain color 
photoreceptors, the R7 neurons, 
are stochastically specified even 
though the downstream neurons 
they need to connect to, the Dm8& 
neurons, are not randomly speci- 
fied. Courgeon and Desplan find 
that Dm8 neurons are actually 
specified in subtypes and are 
produced in excess. Those Dm8& 
neurons that connect to an R7 
input survive; those that fail to 
find a match die by apoptosis. 
The matching is facilitated by a 
pair of cell adhesion molecules. 
The stochastic results of R7 dif- 
ferentiation are thus propagated 
downstream even though those 
downstream elements are not 
stochastically determined. —PJH 
Science, this issue p. 325 


ORGANIC CHEMISTRY 
Charging through the 
looking glass 


Asymmetric catalysis is a com- 
monly applied technique to 
prepare just one of two mirror- 
image products in a chemical 
reaction. But what if you already 
have the compound you want, 
stuck in a mixture of left- and 
right-handed enantiomers? Shin 
et al. now show that light-induced 
electron transfer can trigger a 
favorable succession of proton 
and hydrogen-atom transfer 
steps, both of which are suscep- 
tible to biasing by catalysts, to 
preferentially convert a mixture 
of cyclic urea enantiomers into 
just one (see the Perspective by 
Wendlandt). —JSY 

Science, this issue p. 364; 

see also p. 304 


BIODIVERSITY 
Spatial structure of 
species change 


Biodiversity is undergoing 
rapid change driven by climate 
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change and other human influ- 
ences. Blowes et al. analyze 
the global patterns in temporal 
change in biodiversity using a 
large quantity of time-series 
data from different regions (see 
the Perspective by Eriksson and 
Hillebrand). Their findings reveal 
clear spatial patterns in rich- 
ness and composition change, 
where marine taxa exhibit the 
highest rates of change. The 
marine tropics, in particular, 
emerge as hotspots of species 
richness losses. Given that 
human activities are affecting 
biodiversity in magnitudes and 
directions that differ across 
the planet, these findings 
will provide a much needed 
biogeographic understanding 
of biodiversity change that can 
help inform conservation priori- 
tization. -AMS 

Science, this issue p. 339; 

see also p. 308 


NEUROSCIENCE 
Special moments at 
cortical quiet states 


Delta waves are moments of 
widespread cortical silence that 
alternate with active states dur- 
ing slow-wave sleep. However, 
upon closer examination, single 
neuronal action potentials can 
be detected during delta waves. 
Todorova and Zugaro sought 
to determine whether this 
neuronal noise could instead 
be an important signal (see 
the Perspective by Ikegaya 
and Matsumoto). They found 
that persisting action potential 
firing during delta waves is an 
overlooked but widespread 
phenomenon, which could 
potentially involve all neurons 
and all delta waves. A critical 
role of the delta wave may be 
to insulate specific cortical 
computations taking place in 
response to hippocampal replay 
and involved in memory consoli- 
dation. —PRS 

Science, this issue p. 377; 

see also p. 306 


Published by AAAS 


TURBULENCE 
Down and down the 
energy cascade 


Injecting energy into a turbulent 
system at large length scales 
results in the energy cascading 
down and eventually dissipating 
at a characteristic small length 
scale. In conventional fluids, this 
small scale is set by fluid viscos- 
ity. Navon et al. studied the 
turbulence energy cascade ina 
quantum gas, a Bose-Einstein 
condensate of rubidium-87 
atoms held in a uniform trap. 
Dissipation occurred by atoms 
escaping from the trap at a scale 
that could be tuned by varying 
the height of the trapping poten- 
tial. Thanks to the flexibility of 
their setup, the researchers were 
able to study both the steady 
state, in which energy is injected 
at the same rate it is dissipated, 
and the transient regime preced- 
ing the steady state. —JS 
Science, this issue p. 382 


WATER OXIDATION 
Inspecting S states in 
photosynthesis 


Oxygenic photosynthesis uses a 
Mn,CaO, cluster in the oxygen- 
evolving complex to extract 
electrons from water and pro- 
duce dioxygen. Visualizing each 
of the chemical states in this 
process, S, to S,, and assigning 
chemical identities and mecha- 
nisms on the basis of structures 
has been a challenge addressed 
recently by work at x-ray free- 
electron lasers. Suga et al. 

used serial crystallography at 
cryogenic temperatures to trap 
and determine the structures 

of several stable states during 
photosystem II water oxidation 
(see the Perspective by Britt and 
Marchiori). Changes around the 
water cluster already happen in 
the S, state and set the stage 
for water insertion that occurs 
during transition to the S, state. 
A short 1.9-angstrom distance 
between the two oxygen atoms 
in the S, state is consistent with 


sciencemag.org SCIENCE 
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theoretical studies supporting an 

oxyl/oxo mechanism for oxygen- 
oxygen coupling. —MAF 

Science, this issue p. 334; 

see also p. 305 


IMMUNOLOGY 


Mobilizing T cells 
Accumulating evidence sug- 
gests that a nonclassical 
antigen presentation pathway, 
mediated by human leukocyte 
antigen-E (HLA-E), has an 
important role in regulating 
innate and adaptive immune 
responses to infections and 
cancer. In a Perspective, 
Ottenhoff and Joosten dis- 
cuss the dichotomous role of 
HLA-E in suppressing immune 
responses (through an immune 
checkpoint), as well as in acti- 
vating unconventional T cells. 
This nonclassical pathway has 
the potential to be manipulated 
to prevent infectious disease 
through vaccination and to treat 
cancer through immunotherapy. 
—GKA 

Science, this issue p. 302 


NEURODEVELOPMENT 


How GABA makes 
the switch 


A high concentration of chloride 
ions in neurons interferes with 
signaling mediated by the neu- 
rotransmitter y-aminobutyric 
acid (GABA). The transporter 
KCC2 extrudes chloride ions, 
and Watanabe et al. and Pisella 
et al. examined mice express- 
ing KCC2 with mutations 
that mimicked constitutive 
phosphorylation at two regula- 
tory sites (see the Focus by 
Zamponi). Dephosphorylation 
of these sites in KCC2 enables 
GABA to switch from being an 
excitatory to being an inhibitory 
neurotransmitter during neuro- 
development, thus allowing the 
maturation of the neurocircuitry 
underlying cognition and respi- 
ration. —LKF 

Sci. Signal. 12, eaaw9315, eaayO300, 

eaay8960 (2019). 
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NEUROSCIENCE 
MORE model to fight 


against addiction 

The United States is in the midst 
of an opioid crisis affecting 
more than 15 million Americans. 
Garland et al. provide neuro- 
physiological evidence for a 
therapeutic approach to help 
individuals with opioid use 
disorder (OUD). They conducted 
aseries of randomized experi- 
ments using an approach called 
Mindfulness-Oriented Recovery 
Enhancement, or MORE. 
Hedonic dysregulation in brain 
reward circuitry has been viewed 
as a core mechanism of addic- 
tive behavior; the MORE model 
aims to restructure reward 
responsiveness using mindful- 
ness techniques. MORE may 
remediate hedonic dysregulation 
by simultaneously increas- 

ing responsiveness to natural 
rewards and decreasing reactiv- 
ity to drug-related rewards. 

The use of MORE and other 
cognitive training interventions 
may ultimately serve to turn the 
tide on addiction in the United 
States. —KSL 


Sci. Adv. 10.1126/ 
sciadv.aax1569 (2019). 


Published by AAAS 
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REVIEW SUMMARY 


SIGNALING 


Sphingosine 1-phosphate: Lipid signaling 


in pathology and therapy 


Andreane Cartier and Timothy Hla* 


BACKGROUND: Sphingosine 1-phosphate (S1P), 
a product of membrane sphingolipid metab- 
olism, is secreted and acts through G protein- 
coupled S1P receptors (S1PRs) in vertebrates. 
SIPR isoforms mediate complex cellular ac- 
tions either alone or in combination in most 
organ systems. This stable lysolipid circulates 
as a complex with protein chaperones that not 
only enables aqueous solubility but also helps 
facilitate specific modes of receptor signaling. 
However, differential concentration gradients 
of SIP are normally present in various com- 
partments and are perturbed under disease 
conditions. The abundance of circulatory SIP 
and the high expression of S1PRs in exposed 
cells—that is, vascular and hematopoietic cells— 
poses a key question of how this signaling axis 
is regulated. This question is of clinical rele- 
vance because the first SIPR-targeted drug, 
fingolimod, has been approved for the treat- 
ment of multiple sclerosis since 2010. Recent 
findings from basic research as well as insights 
gleaned from clinical and translational studies 
have enriched our understanding of how this 
simple lysolipid evolved as a complex regula- 


S1PR signaling regu- 
lates multiorgan patho- 
physiological processes. 
Extracellular SIP gra- 
dients created by trans- 
porters, chaperones 
(ApoM*HDL), and meta- 
bolic enzymes (LPP3) 
interact with SIPRs on the 
cell surface. Receptor 
activity, transmitted by 
means of G proteins, is 
egulated by multiple 
mechanisms, including 
B-arrestin coupling, endo- 
cytosis, and receptor 
modulators. The resultant 
cellular changes influence 
multiple organ systems in 
physiology and disease. 


Recycling 
endosome 


Immune system 


mye 


Cartier et al., Science 366, 323 (2019) 


Cardiovascular 
system 
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tor of multiple physiological systems and, when 
dysregulated, contributes to numerous diseases. 


ADVANCES: Extracellular spatial gradients of 
S1P, demonstrated by using SIP reporters, are 
tightly regulated and control fundamental pro- 
cesses such as hematopoietic cell trafficking, 
immune cell fate, and vascular integrity. The 
gradients are formed through location-specific 
function of metabolic enzymes, SIP transport- 
ers, and chaperones. Such physiological S1P 
gradients are altered in diseases, thus con- 
tributing to conditions such as inflammation, 
autoimmunity, and vascular dysfunction. S1P 
complexed to chaperone proteins—for example, 
high-density lipoprotein-bound apolipoprotein 
M-—mediate distinct modes of receptor activa- 
tion, resulting in biased receptor signaling and 
specific biological outcomes. SIPRs are also 
regulated tightly through endocytic mecha- 
nisms and receptor modulators that enhance 
or inhibit signal strength and duration. Vari- 
ous signaling mechanisms of this simple lyso- 
lipid mediator has helped reveal its multiple 
actions in the immune system, which include 


ApoM+HDL@ 


Cell migration, survival, fate, gene expression 


: Centralnervous : 
system 


Organ fibrosis 


adaptive immune cell localization in various 
compartments (egress versus retention), fate 
switching, survival, and activation that influen- 
ces both cell-mediated and humoral immunity. 
In the cardiovascular system, high expres- 

sion of multiple S1PR iso- 
forms in various cell types 
Read the full article regulate development, ho- 
at http://dx.doi. meostasis, and physiolo- 
org/10.1126/ gy. Current SIPR-targeted 
science.aar5551 drugs that aim to tame 
a See “iitoinatatiairertibitcory 
siderable cardiovascular-adverse events. In 
the central nervous system (CNS), widespread 
application of S1PR-targeted drugs in auto- 
immune neuroinflammatory diseases has stim- 
ulated research that revealed the broad but 
poorly understood effects of S1P signaling in 
neurodevelopment, the neurovascular unit, 
neurons, and glia. Furthermore, in addition to 
the involvement of pathological S1P signaling 
in acute ischemic conditions of various organs, 
chronic dysregulated SIP signaling has been 
implicated in fibrotic diseases of lung, heart, 
liver, and kidney. 


OUTLOOK: Considerable challenges remain 
to fully harness the new knowledge in S1P 
pathobiology to translational utility in clinical 
medicine. Approaches that mimic S1P chap- 
erones, SIP neutralizing agents, modulation 
of transporters, biased agonists and antag- 
onists of SIPR isotypes, and sphingolipid 
metabolic enzyme modulators provide viable 
pathways to therapy. Focusing on the im- 
mune system, such approaches may widen 
the autoimmunity therapeutic landscape and 
provide new directions in cancer and chronic 
inflammatory diseases. For cardiovascular di- 
seases, ischemic conditions as well as chronic 
heart failure are likely candidates for fu- 
ture translational efforts. Although further 
work is needed, S1P-targeted approaches may 
also be useful in regenerative therapies for 
the aging and diseased myocardium. The 
CNS-targeted efforts may cross into neuro- 
degenerative diseases, given the success 
with S1PR-targeted drugs in reducing brain 
atrophy in multiple sclerosis. Other poten- 
tial applications include approaches in pain 
management and neurodevelopmental dis- 
orders. Such strategies, although challeng- 
ing, are greatly helped by findings from 
basic research on S1P pathobiology as well 
as pharmacological and clinical insights 
derived from the application of S1P-targeted 
therapeutics. 
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Sphingosine 1-phosphate: Lipid signaling 


in pathology and therapy 


Andreane Cartier and Timothy Hla* 


Sphingosine 1-phosphate (S1P), a metabolic product of cell membrane sphingolipids, is bound to 
extracellular chaperones, is enriched in circulatory fluids, and binds to G protein-coupled S1P receptors 
(S1PRs) to regulate embryonic development, postnatal organ function, and disease. S1PRs regulate 
essential processes such as adaptive immune cell trafficking, vascular development, and homeostasis. 
Moreover, S1PR signaling is a driver of multiple diseases. The past decade has witnessed an exponential 
growth in this field, in part because of multidisciplinary research focused on this lipid mediator and 
the application of S1PR-targeted drugs in clinical medicine. This has revealed fundamental principles of 
lysophospholipid mediator signaling that not only clarify the complex and wide ranging actions of S1P 
but also guide the development of therapeutics and translational directions in immunological, 
cardiovascular, neurological, inflammatory, and fibrotic diseases. 


ipids are best known for their roles in 

energy storage and formation of cellular 

membranes. The diversity of membrane 

lipid structures ensures structural flexi- 

bility and integrity of cellular membranes 
needed for adaptation to varying environ- 
ments. As vertebrate evolution ensued, metab- 
olites of membrane lipids were repurposed 
as extracellular ligands for G protein-coupled 
receptors (GPCRs), which in turn activate num- 
erous intracellular signaling pathways, thus 
coupling membrane phospholipid breakdown 
with intercellular communication (J). This cou- 
pling was first illustrated by platelet activating 
factor, a bioactive lipid synthesized from mem- 
brane phospholipids of hematopoietic and vas- 
cular cells that activates GPCRs in an autocrine 
and paracrine manner to evoke allergic reac- 
tions and anaphylaxis (2). Similarly, sphingo- 
sine 1-phosphate (SIP)—which is produced and 
secreted by red blood cells (RBCs), endothe- 
lial cells, and platelets into the extracellular 
environment—acts on specific GPCRs expressed 
by most if not all cell types (3). 

SIP was discovered in the 1960s as a ter- 
minal product of sphingolipid metabolism (4). 
Studies in cultured cells suggested its bioactive 
nature, even though the prevailing notion at 
the time considered it more akin to classical 
second messengers such as diacylglycerol and 
Ca?* (5, 6). We now know that vertebrates pos- 
sess five SIP receptors (SIPR1 to SIPR5) that 
respond to extracellular S1P to regulate em- 
bryonic development, physiological homeosta- 
sis, and pathogenic processes in multiple organ 
systems (3). These high-affinity SIPRs couple 
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to key intracellular signaling pathways con- 
trolled by heterotrimeric G proteins, Rho fam- 
ily small guanosine triphosphatases (GTPases), 
and the protein kinase AKT, resulting in 
changes in cytoskeleton, cell adhesion, and 
survival (7). Although the extracellular actions 
of S1P are well established, distinct intracel- 
lular functions of S1P have also been sug- 
gested. For example, S1P was purported to 
bind to and regulate the activities of cytosolic 
signal transduction mediator tumor necrosis 
factor-a (TNF-a) receptor-associated factor 2 
(TRAF-2) (8), the chromatin-modifying enzyme 
histone deacetylase-1 (HDAC-1) (9), the mito- 
chondrial regulator prohibitin-2 (0), atypical 
protein kinase-C (7), and the catalytic subunit 
of telomere reverse transcriptase (72). Wheth- 
er intracellular S1P signaling is physiologically 
relevant is not clear because genetic studies do 
not support the essential or functional roles of 
intracellular S1P targets (13-16). 

The distinct physicochemical properties of 
SIP endow its multifunctional nature in de- 
velopmental, physiological, and pathological 
contexts (77). For example, resistance to oxi- 
dative modification makes S1P suitable as a 
modulator of inflammatory processes, which 
are characterized by the influx of cells with 
high oxidative capacity. Poor aqueous solubil- 
ity of SIP necessitates its association with pro- 
tein molecules for transport in the blood and 
lymph circulation and access to S1PRs (/8). 
Such interactions also facilitate the formation 
of SIP gradients, which are spatial concentra- 
tion differences across biological compartments, 
thus allowing selective receptor activation at 
various locations (18-21). These properties en- 
able S1P to be involved in multiple physio- 
logical processes. The essential nature of SIP 
functions has been illustrated in humans with 
congenital conditions such as Sjégren-Larsson 
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syndrome, nephrosis, adrenal insufficiency, 
and hearing impairment and in mouse genetic 
studies in which major embryologic and post- 
natal defects in various organs were observed 
when S1PRs, transporters, and S1P metabolic 
enzymes were mutated (3, 22, 23). 

S1P-regulated processes contribute to several 
cardiovascular, autoimmune, inflammatory, 
neurological, oncologic, and fibrotic diseases 
(3). Moreover, the successful U.S. Food and 
Drug Administration (FDA) approval of an 
SIPR-targeted drug, fingolimod, to treat mul- 
tiple sclerosis (MS) in 2010 (24) has led to 
considerable growth of research and drug de- 
velopment efforts. However, fingolimod binds 
and activates four SIPRs (SIPRI, -3, -4, and -5) 
while also functionally antagonizing S1PR1. 
Recently, a more specific SIPR1- and SIPR5- 
targeted drug, sipondimod, was approved for 
the treatment of MS. In this Review, we dis- 
cuss how conceptual advances in understand- 
ing SIP signaling have reached a critical 
threshold to allow rational design of new 
therapeutic strategies. 


S1P spatial gradients 


Extracellular gradients of growth factors, cyto- 
kines, and developmental morphogens are 
critical for achieving fidelity and specificity 
of biological responses (25). Lipid mediators, 
small molecules with limited aqueous solubil- 
ity but affinity for hydrophobic molecules, 
adopted distinct signaling mechanisms to 
overcome these physicochemical challenges. 
For example, tissue- and cell-specific secretion, 
chaperone association, and extracellular metab- 
olism of S1P result in the formation of SIP 
spatial gradients that are important for its 
numerous biological activities. 

The enzymes that generate $1P—namely, 
ceramidase and sphingosine kinase (SPHK)— 
do double duty in intracellular sphingolipid 
metabolism as well as extracellular signaling 
(26). In many tissues, S1P is rapidly degraded 
by the endoplasmic reticulum-resident S1P 
lyase, resulting in very low intracellular S1P 
concentrations. By contrast, S1P can also be 
coupled to export processes by means of spe- 
cific transporters. These flip the SIP molecule 
from the inner to the extracellular leaflet of 
the plasma membrane, allowing extracellular 
action of SIP (3). Once exported, SIP is cap- 
able of activating SIPRs expressed in the same 
cell in an autocrine manner. However, extrac- 
tion by chaperone proteins, which bind to S1P 
and enable aqueous solubility, lead to diffusi- 
ble SIP in the extracellular environment, thus 
creating spatial gradients that activate SIPRs in 
a paracrine and/or endocrine manner (Fig. 1). 

Tissue-specific transporters that belong to 
the “major facilitator” superfamily export S1P 
to establish extracellular gradients. For exam- 
ple, spinster homolog 2 (SPNS2), first identi- 
fied in a zebrafish screen for cardia bifida (a 
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developmental condition that leads to the for- 
mation of two beating hearts), functions in the 
extraembryonic yolk syncytial layer to export 
SIP, which binds to SIPR2 on the cells of the 
endoderm, which in turn allows directional 
cardiac progenitor cell migration to the mid- 
line and proper cardiac development (27). This 
is the first example of SIP extracellular gra- 
dients created by a lipid transporter that is 
required for a developmental event (28, 29). 
In mice, SPNS2 exports S1P from vascular and 
lymphatic endothelial cells (LECs) to activate 
SIPRs on lymphocytes, thus allowing their 
egress from lymph nodes into the circulatory 
system (30, 31). High circulating SIP concen- 
trations together with low amounts of SIP in 
interstitial fluid in secondary lymphoid organs, 
such as lymph nodes, are critical for lympho- 
cyte egress into lymphatic circulation (79). 
Mouse models that use reporters to sense 
extracellular SIP revealed SIP gradients at 
higher resolution. In the spleen, extracellu- 
lar S1P gradients formed by the phosphatase 
enzyme lipid phosphate phosphohydrolase 3 
(LPP3) are essential for marginal zone B cell 
trafficking (32), whereas LEC-expressed SPNS2 
in lymph nodes regulated medullary cord nat- 
ural killer (NK) cell location and function (33). 
The transporter major facilitator superfamily 
domain-containing protein 2B (MFSD2B), 
which exports S1P from RBCs into plasma, is 
also important for their normal function and 
turnover (34). Loss of function of S1P trans- 
porters leads not only to developmental defects 
but also to early onset hearing loss (35) and 
retinal defects (36). These observations sug- 
gest that spatial SIP signaling, which is critical 
in embryogenesis and postnatal physiological 
processes, leads to pathological conditions 
when dysregulated. Cellular mechanisms by 
which S1P transporters are controlled are poor- 
ly understood. Strategies to restore normal 


transporter function could be useful in the 
treatment of various diseases in which extra- 
cellular SIP gradients are perturbed. 


Chaperones, biased signaling, 
and S1PR regulators 


S1P chaperone proteins, which stably bind and 
transport SIP in circulatory and interstitial fluids, 
facilitate SIPR activation on recipient cells. The 
prototypical S1P chaperone is apolipoprotein 
M (ApoM), a component of high-density lipo- 
protein (HDL), a circulating lipoprotein asso- 
ciated with vascular health (20). In addition, 
serum albumin binds to S1P at lower affinity 
(37). HDL-bound ApoM binds S1P stably in 
blood plasma and activates S1PRs on endo- 
thelial cells to suppress inflammation, pro- 
mote barrier function, allow liver regeneration 
after hepatectomy, and protect from bacterial 
endotoxin-induced inflammatory responses 
(38-41). In hematopoietic progenitor cells, 
ApoM-bound SIP suppresses lymphopoiesis 
(42). HDL-bound S1P activates SIPR1 as a 
biased agonist that selectively induces certain 
biological responses. Other HDL-bound factors— 
such as apolipoprotein Al, cellular HDL recep- 
tor scavenger receptor-B1, and the ability of 
HDL to modulate membrane raft-based orga- 
nization of receptor-effector complexes—may 
all be involved in biased signaling (43). Ad- 
ditionally, the relative stability of HDL-bound 
SIP compared with albumin-bound S1P may 
also play a role in differential kinetics of re- 
ceptor activation and signaling (Fig. 2) (38). By 
contrast, albumin-bound SIP is short-lived in 
plasma (44) and activates SIPRs in a manner 
distinct from that of HDL-S1P. 

Once activated by small-molecule agonists 
or albumin-bound SIP, S1PR expression on the 
cell surface is down-regulated, which makes 
vascular and hematopoietic cells exposed to 
circulatory S1P refractory to further stimula- 
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Fig. 1. Establishing and maintaining S1P gradients in vessels. S1P is transported by chaperones 
ApoM-*HDL and albumin in the circulation and presented to SIPRs. SIP export by vascular endothelial cells 
(VECs) and lymphatic endothelial cells (LECs) (by SPNS2) and by RBCs (by MFS2B) as well as degradation 
of SIP by the phosphatase LPP3 modify the extracellular SIP gradient. Low interstitial tissue SIP is achieved 
by the SIP lyase. Tissue concentrations of SIP are indicated in parentheses. SIPR, sphingosine 1-phosphate 
receptor; SPNS2, sphingolipid transporter 2; MFSD2B, major facilitator superfamily domain containing 2B; 


LPP3, lipid phosphate phosphatase 3. 
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tion. Regulation of SIPR1 endocytosis is pre- 
cisely controlled by GPCR kinase 2, dynamin, 
and moesin (45-48). Loss of such S1PR1 regu- 
latory events leads to changes in biological out- 
comes such as altered immune cell trafficking 
(46) and vascular endothelial barrier function 
(49). Sustained endocytosis of lymphocyte 
S1PRI1 explains the therapeutic efficacy of SIPR1 
functional antagonists in the treatment of auto- 
immune diseases. 

The recent development of SIPR reporter 
mice has revealed plasma S1P-dependent f- 
arrestin activation in vascular endothelium 
upon systemic endotoxemia (50, 57) and in 
the aortic endothelium at areas of disturbed 
flow (38). This suggests that SIPR1 is down- 
regulated in injured and inflamed endothelium. 
The ability to monitor S1PR function in vivo in 
real time and at single-cell resolution is anti- 
cipated to lead to new insights into how SIP 
signaling is regulated physiologically and in 
disease. 

In addition to the natural ligand SIP, other 
molecules as well as coreceptors modulate 
SIPR activity. For example, conjugated bile 
acids (CBAs) activate SIPR2, which may be im- 
portant in hepatobiliary and intestinal systems 
during cholestatic injury (52). In endothelial 
cells, activated protein C receptor and the 
hyaluronic acid receptor CD44 transactivate 
S1PR1 to increase barrier function and thereby 
prevent plasma leakage, thus limiting inflam- 
matory responses (53, 54). In lymphocytes, 
activation-induced CD69 binds to SIPR1 and 
down-regulates its expression, an event that 
is critical for regulation of lymphocyte egress 
and tissue residence (55). In the central nervous 
system (CNS), the binding of the extracellular 
domain of NOGO-A, a neuronal repulsive mol- 
ecule that regulates neural network formation, 
to SIPR2 induces signals by means of the 
GPCR Goal13-RhoA signaling pathway to repress 
neurite outgrowth, thus achieving synaptic 
plasticity in the hippocampus and motor cor- 
tex (56). An unbiased, genome-wide CRISPR/ 
Cas9 screen for SIPR1 modulators identified 
lysophosphatidic acid receptor-1 as a negative 
regulator. This axis is important in lymphatic 
endothelial cells of the lymph node sinuses 
(57). In y6 T cells, SIPR2 inhibition of SIPR1 
function restrains them in the skin, thus pro- 
moting immunity in the local environment by 
gaining tissue residency (58). 

These examples illustrate that SIPRs evolved 
to interact with many other signaling pathways 
involved in embryonic development, inflam- 
mation, host defense, and homeostasis. Mech- 
anistic details of how S1P transfer takes place 
from the chaperones to S1PRs, the action of 
various receptor modulatory molecules, and 
the functions of regulated endocytosis and 
plasma membrane retention of SIPRs need 
to be further elucidated. Interestingly, the crys- 
tal structure of agonist-bound SIPR1 suggested 
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Fig. 2. SIPR1 signaling regu- 
lation. Schematic representa- 
tion of regulators of SIPR1 cell 
surface expression, signaling, 
and turnover. Albumin-SIP, 
HDL-SIP, and FTY720-P acti- 
vate S1PR1. Internalization of 
S1PR1 eventually leads to pro- 
teasomal degradation by the 
ubiquitin ligase WW domain 
containing E3 ubiquitin protein 
ligase 2 (WWP2). CD44 and 
activated protein C (APC)/ 
endothelial protein C receptor 
(EPCR) transactivate S1PR1, 
while CD69 binds to and down- 
regulates SIPR1 at the cell 
surface. Endocytosis of SIPR1 is 
achieved through GPCR kinase 
2 (GRK2)-dependent phospho- 
rylation and endocytic regula- 
tors (dynamin, moesin, 
B-arrestin, and AP2/Clathrin). 
The intracellular signaling of 
SIPR1 is transmitted by means 
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of the heterotrimeric Gai family proteins, the inhibition of adenylyl cyclase, and activation of phosphatidylinositol 3-kinase (PI3K)-AKT and ERK pathways. 
Palmitoylation by DHHC5 affects the coupling of SIPR1 to Gai, whereas o-synuclein modulates the coupling of SIPR1 with Gai by removing S1PRI1 from lipid rafts. 


Regulators of other SIPRs are not well characterized. 


that ligand (S1P) entry takes place by means of 
the transmembrane domains of SIPR1 (59). 
Therefore, it is likely that HDL docking on the 
plasma membrane enables efficient S1P trans- 
fer to S1PRs by way of a transmembrane route. 
New structural analysis methods, such as cryo- 
electron microscopy, may reveal dynamic re- 
ceptor conformations that are anticipated to 
lead to insights for the development of highly 
specific receptor modulatory agents. This may 
also lead to general understanding of how lipid 
mediators function in regulating cellular and 
organ function. 


Cardiovascular development 
and pathophysiology 


The vasculature, one of the first organ systems 
to develop in vertebrate embryogenesis, is reg- 
ulated by circulatory S1P signaling (Fig. 3) 
(60, 61). Initial phases of angiogenesis encom- 
pass hypoxia-induction of vascular endothelial 
growth factor (VEGF) that induces endothelial 
sprouts, in which S1P signaling is minimal. 
When vascular sprouts fuse to form primary 
vascular networks, circulatory SIP accesses en- 
dothelial cells. These fusion events coincide 
with the induction of SIPR1 expression in the 
endothelium, and consequently, the formation 
of adherens junctions and cell-extracellular 
matrix adhesions that enable the vascular bar- 
rier to form (62, 63). Activation of endothelial 
nitric oxide synthase (eNOS) dilates vessels and 
allows proper blood flow (67). Multiple redun- 
dant sources of circulating SIP from RBCs, en- 
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dothelial cells, and platelets and the existence 
of sensing mechanisms to monitor and main- 
tain circulating S1P levels during vascular de- 
velopment reinforces the essential nature of 
spatial S1P gradients (64, 65). 

Postnatally, SIP signaling regulates normal 
vascular homeostasis. For example, S1P-SIPR1 
signaling maintains endothelial cell barrier func- 
tions to limit plasma and leukocyte extravasa- 
tion (66, 67). Deficiency in this homeostatic 
signaling system, such as reduced HDL-SI1P, 
leads to compromised barrier function and 
vascular tone, which is needed to control blood 
flow and systemic blood pressure (39, 68). In 
addition, SIP signaling in the endothelium pro- 
tects the glycocalyx, a protective, glycoprotein- 
rich layer that lines the apical surface of blood 
vessels (69). Recent identification of the endo- 
plasmic reticulum-resident protein, NOGO-B, 
as a negative regulator of de novo sphingolipid 
synthesis and autocrine S1P signaling in the 
endothelium suggests the presence of local, as 
well as systemic, factors in the regulation of 
vascular tone and blood pressure (70, 71). The 
control of local S1P signaling may be vascular 
bed-specific—that is, in arteries versus veins— 
and possibly even organ-specific to enable pre- 
cise control of the vascular system in response 
to metabolic and physiological cues of a spe- 
cific organ system (72). 

Diabetes and hypertension as well as pre- 
mature birth frequently lead to vascular dis- 
eases of the eye (retina), such as retinopathy, 
which is characterized by abnormal blood flow, 
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excessive vascular leak, and blood vessel over- 
growth. Neutralizing S1P antibodies reduced 
retinal neovascularization, and Sipr2-deficient 
mice showed attenuated angiogenesis in a 
mouse model of retinopathy of prematurity, a 
condition that occurs in premature neonates 
exposed to high levels of oxygen (73, 74). In 
diabetes as well as retinopathy of prematurity, 
the motility of Miiller glial cells—which moni- 
tor retinal homeostasis, support neurons, and 
contribute to retinal structure and function— 
is regulated by S1P activation of S1PR3, sug- 
gesting a role for pathologic S1P signaling in 
retinal astrogliosis (scar formation) (75). SIP 
signaling may be a useful therapeutic target 
for proliferative retina diseases, such as diabe- 
tic retinopathy and age-related macular dege- 
neration (wet form). 

Cardiac development is also regulated by 
SI1PR signaling. In zebrafish, the SPHK2-SPNS2- 
SIPR2-hippo signaling pathway is essential for 
endoderm development and myocardial pro- 
genitor cell migration (/4, 28, 76). In mice, SIPR1 
function in the endocardium, cardiomyocytes, 
and endothelial cells is essential for embryonic 
heart development (77). In human induced 
pluripotent stem cells, S1P induces cardiomyo- 
cyte differentiation (78), suggesting a critical 
role of SIP in development of the myocardium 
(heart muscle). Although S1P pathway al- 
terations have not been causally linked with 
congenital heart defects in humans, these ob- 
servations suggest that S1P is involved in post- 
natal cardiac remodeling events, including 
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Fig. 3. SIP function in vascu- 
lar development, homeostasis, 
and pathology. Vascular 
network development, matura- 
tion, and stabilization are 
regulated by endothelial 

SIPRI. Activation of SIPR1 
inhibits sprouting angiogenesis 
and vascular endothelial growth 
factor receptor 2 (VEGRF2) 
and promotes vascular stability 
through adherens junctions 
and cell-extracellular matrix 
adhesion (integrins). Loss 

of cell-surface SIPR1 and 
destabilization of the endothe- 
lium causes the leakage of 
plasma and immune cells into 
the tissue, leading to inflamma- 
tion. Vascular damage also leads 
to thrombosis (platelet aggrega- 
tion) and fibrin deposition in 

the interstitial space, which 
activates fibroblasts and 
macrophages. Loss of SIPR1 
signaling on endothelial cells and 
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activated SIPR2 and SIPR3 signaling on macrophages and fibroblasts ultimately lead to tissue fibrosis, with extracellular matrix deposition, and organ dysfunction. 


heart failure, a complex disease in which car- 
diac tissue is remodeled and the pumping ef- 
ficiency of the heart diminishes. 

Postnatally, cardiac S1PRs regulate several 
pathophysiological processes such as cardiac 
rhythm, vascular perfusion, and myocardial 
protection. SIP induces bradycardia (slow 
heart rate) by activating the SIPR1-regulated 
G protein-coupled inwardly-rectifying potas- 
sium (GIRK) channels, which hyperpolarize 
myocardial cells and reduce the rate of action 
potential firing (79). HDL-induced SIP signal- 
ing by means of endothelial SIPRs allows 
efficient coronary circulation through eNOS- 
dependent vasodilatation and protection from 
myocardial ischemia-reperfusion injury, which 
is an important clinical problem after heart 
attacks (80). Importantly, the SIP pathway is 
cardioprotective. During ischemic precondi- 
tioning, a phenomenon in which small and 
transient ischemia protects from massive ische- 
mic insults, SPHK is induced in the myocar- 
dium, resulting in enhanced synthesis of SIP, 
which protects the heart by promoting circu- 
lation and reducing the heart rate (87). Further, 
in a porcine model of ischemia-reperfusion 
injury, activation of SIPRs by fingolimod re- 
duced infarct size and improved cardiac func- 
tion (82). Moreover, in a mouse model of acute 
myocardial infarction, treatment with an engi- 
neered S1P chaperone (ApoM-Fc) suppressed 
cardiac injury and enhanced the recovery of 
cardiac function (67). 

An endogenous regulator of SIP metabolism, 
NOGO-B was shown to modulate plasma and 
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inflammatory cell extravasation from blood ves- 
sels in pressure-overloaded hearts and patho- 
logical cardiac hypertrophy through endothelial 
SIPR1 autocrine signaling (83). The complex 
developmental and pathophysiological roles 
of SIP in the cardiovascular system provides 
therapeutic opportunities. For example, endo- 
thelial cell-targeted SIPR1 agonism may be 
useful in limiting tissue damage during ischemia- 
reperfusion injury. In addition, down-regulating 
S1PRs in autoimmune diseases leads to adverse 
cardiovascular effects, such as bradycardia and 
vascular leakage, which were predicted by pre- 
clinical and mechanistic studies (84). Therefore, 
SIPR-targeted drugs that are more selective 
for either the heart, vascular, or immune S1PRs 
may provide improved safety profiles in the 
control of cardiovascular and autoimmune 
disorders, respectively. 


Complex actions in immunity 


In addition to its established roles in immune 
cell trafficking, S1P signaling regulates diverse 
immunological processes, including cell fate 
switching, cell survival, innate immunity, and 
antitumor immune responses (Fig. 4). 

Recent studies have illustrated the role of 
SIP gradients and S1PR signaling in the migra- 
tion and retention of adaptive immune cells in 
resident tissues, thus influencing the specific- 
ity and magnitude of immune responses. En- 
dothelial cell SPNS2, by means of the export of 
SIP, mediates the egress of immature B cells 
from the bone marrow and naive T cells from 
the thymus (30, 37). In the spleen, mature B cell 
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localization and shuttling at the specific im- 
munological region of this organ, called the 
marginal zone (MZ), is critical for the adaptive 
immune response against blood-borne anti- 
gens that get trapped in the spleen (85). B cell 
SIPR1 ensures MZ localization, whereas SIPR3 
inhibits MZ shuttling, both of which are im- 
portant for optimal immune responses (86). 

B cell confinement and growth regulation 
in the germinal centers of lymphoid follicles, 
which are sites for B cell maturation and clo- 
nal selection, is controlled by SIPR2 signaling 
through the Gol3 GPCR signaling pathway 
(87). In the absence of this spatial restriction, 
B cell mislocalization is observed and may 
contribute to the dissemination of lymphoma 
(a B cell tumor). A recent study showed that 
in glioblastoma brain tumor patients, naive 
T cell SIPR1 is down-regulated, leading to se- 
questration of such cells in bone marrow, 
resulting in lymphopenia and decreased anti- 
tumor immune responses (88). Prevention of 
this tumor-induced S1PR1 loss of function in 
T cells could enhance antitumor immunothe- 
rapy treatments. These illustrative examples 
demonstrate how spatial SIP gradients and 
cell-specific SIPR isotypes shape adaptive im- 
mune responses. 

CDs8* resident memory T cells (CD8*Tpy 
cells)—long-lived, noncirculating cells that 
are critical for immunological memory and 
host defense—show reduced expression of the 
transcription factor Kruppel-like factor 2 (KLF2) 
and its downstream target gene SJpri in mice, 


thus effectively minimizing egress and achieving 
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Fig. 4. S1P function in immunity. Immune cell function, egress, and survival are highly dependent on the regulation of SIP and SIPR signaling. Innate immune 
cell localization to inflammatory sites and egress of B and T cells from bone marrow and thymus, toward and out of secondary lymphoid organ marginal zones, rely 
on SIPR cell surface expression and SIP gradient. The metabolism of T cell ATP as well as the regulation of fate switching between Ty17 and regulatory T cells 


are also SIPR1-dependent. The confinement of activated CD4* T cells in germinal centers within secondary ly 


mphoid organs and spleen is regulated by S1PR2, 


whereas SPNS2 and S1PR1 are shown to be implicated in the recruitment of effector T cells to the tumor microenvironment and suppression of metastasis. 


tissue retention (89). In addition, the cell-surface 
lectin CD69 down-regulates S1PR1 in T cells, 
which also blocks tissue egress and reinforces 
Trw Cell differentiation. CD69 expression is also 
critical for tissue retention of not only Try cells 
but also early effector T cells, thus enabling 
rapid immunological memory responses and 
the magnitude of the adaptive immune re- 
sponse (90). The migration of human effector 
memory T cells (which are circulating Tp), cells) 
and recently activated T cells is inhibited by 
SIP because of the resulting down-regulation 
of S1PR1 and enhanced expression of S1PR2 
on these cells (97). Thus, SIPR-dependent sig- 
naling in various memory T cells shape long- 
term immunological memory, a key defense 
mechanism against repeated infections. 

In addition, innate immune cell localization 
and function, which are important for host de- 
fense and inflammatory processes, are regu- 
lated by SIPR signaling. The positioning of NK 
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cells in the medulla of lymph nodes, which is 
important for rapid production of the in- 
flammatory cytokine interferon-y (IFN-y) after 
infection, is regulated by SIPR5 (33). SIPR5 
expression is induced during NK cell matura- 
tion and regulates the egress of NK cells from 
the bone marrow and secondary lymphoid 
organs (92). In plasmacytoid dendritic cells, 
SIPR1 blunts the amplification of inflamma- 
tory IFN-a-dependent downstream immune re- 
sponses (93). Intestinal type-2 innate lymphoid 
cells undergo interorgan migration, a property 
that is essential to their protective role in para- 
sitic infections, in a S1P-dependent manner 
(94). SIPR2 promotes the retention of macro- 
phages at sites of inflammation by inhibiting 
chemokine-induced cellular motility (95). The 
generality of SIPR-dependent immune cell 
trafficking and retention mechanisms in in- 
fluencing pathological processes was recently 
illustrated by the discovery that the S1P trans- 
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porter SPNS2 is essential for the recruitment 
of antitumor CD8* T cells and NK cells to the 
tumor microenvironment, resulting in the 
inhibition of metastasis (96). 

Moreover, S1PRs also modulate immune cell 
energy metabolism, which modulates cell fate 
during immune responses. Naive T cells con- 
stantly survey lymphoid organs for foreign 
antigens, which necessitates high metabolic 
flux to generate intracellular adenosine tris- 
phosphate (ATP). In LECs, SPNS2 exports S1P, 
which activates lymphocyte SIPR1 and helps 
maintain the mitochondrial content of naive 
T cells, thus ensuring high ATP generation by 
mitochondria (97). SIP is also necessary for the 
persistent activation of CD4* T cells in inflamed 
tissues (98). 

SIP signaling in specific contexts appears to 
regulate T cell subtype fate switching, which 
determines divergent immune responses. SIPR1 
phosphorylation and subsequent internalization 
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in T cells regulates T helper 17 (T}17) cell polar- 
ization, which induces pro-inflammatory im- 
mune reactions. Specifically, SIPR1 on the 
plasma membrane of immature T cells pro- 
motes differentiation into T}17 cells by means 
of interleukin-6 (IL-6)-dependent signal trans- 
ducer and activator of transcription 3 (STAT3) 
signaling and enhances autoimmune neuro- 
inflammation (99). SZpri-deficient mice and 
MS patients treated with fingolimod (which 
down-regulates S1PR1), exhibit increased levels 
of regulatory T cells (Treg cells), which suppress 
inflammatory responses (J00). Thus, SIPR1 
regulates proper Ty17 and Treg cell distribu- 
tion across peripheral organs and thereby in- 
fluences the magnitude of adaptive immune 
responses. Permanent deletion of Sipri in 
Treg cells leads to spontaneous autoimmunity, 
whereas acute deletion of SIpr7 induces a 
higher inflammatory response in the experi- 
mental autoimmune encephalomyelitis (EAE) 
model, an animal model for MS in which 
adaptive immune cells enter the CNS to induce 
myelin destruction (J0/). A key unexplored 
question is how altered signaling of SIPR1 
translates to T cell fate switching. It is pos- 
sible that altered T cell localization brought 
about by SIPR1-dependent trafficking and re- 
tention mechanisms might change the mag- 
nitude of cytokine responses that induce fate 
switching. Alternatively, SIPR signaling may 
influence T cell-intrinsic mechanisms that in- 
duce fate switching—forkhead box P3 (FOXP3) 
for Tyee cells and retinoid-related orphan 
receptor-y (RORy) and STATS for Ty17 cells. 
However, modulation of the transcription 
factor STAT3 expression may be involved in 
SIPR1-dependent pathological immune reac- 
tions (102, 103). Although many mechanistic 
steps have been uncovered, much needs to be 
learned about receptor specificity, redundant 
signaling, and involvement in normal and 
pathological immune responses. This could 
allow rational design of immunological and 
anti-inflammatory therapeutics that target 
SIP signaling. 


The central nervous system 


The neurovascular unit of the CNS forms a 
selective barrier called the blood-brain barrier 
(BBB), which is composed of endothelial cells, 
pericytes, astrocytes, and microglia and pro- 
tects the brain parenchyma from circulatory 
elements. Dysregulated S1P signaling disrupts 
the BBB, which is an early event that contrib- 
utes to many CNS diseases, including MS, 
Alzheimer’s disease (AD), and ischemic stroke 
(Fig. 5) (104). Brain endothelial cell SIPR1 reg- 
ulates tight junction protein functions, thus 
leading to the establishment of the BBB. Studies 
in mice revealed that loss of this mechanism 
resulted in a size-selective [molecular weight < 
3 kD] breach of the BBB (705), presumably 
because of defective tight junction function 
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and increased leakage by way of a paracellu- 
lar pathway (that is, between cells). Because 
transient pharmacological inhibition of brain 
endothelial cell SIPR1 leads to increased CNS 
penetration of small molecules, this approach 
may potentially be useful in enhancing drug 
delivery in neurological diseases (105). In ad- 
dition, P glycoprotein, which exports small 
molecules out of the CNS into the circulation, 
is inhibited by S1PR inhibitors, which allows 
enhanced penetration of small-molecule drugs 
into the CNS (106). Such preclinical studies 
have sparked interest in inhibiting SIPR1 to 
enhance CNS-targeted therapeutics, a major 
bottleneck in modern medicine. 

In acute ischemic stroke, SPHK2-dependent 
SIP production in the brain tissue is neuro- 
protective, presumably because of extracellu- 
lar signaling by S1PRs (107). By contrast, SIPR2, 
which is induced in vascular endothelial cells 
after ischemia, was associated with increased 
infarct size, edema, cerebrovascular permeabil- 
ity, hemorrhage, and neurovascular injury 
(108). Fingolimod was effective in rat models 
of intracerebral hemorrhage and stroke through 
immunological and nonimmunological mech- 
anisms (J09). These preclinical studies have 
prompted the initiation of clinical trials to use 
fingolimod in the treatment of stroke (110). 
These studies also highlight the SIP axis in 
protecting the vasculature during ischemic 
stress and hence the underlying neuronal tis- 
sue after stroke. 

The expression of SIPR2 was higher in vas- 
cular endothelium and astrocytes within neu- 
roinflammatory disease-susceptible CNS regions 
of an inbred strain (SJL) of female mice and in 
the white matter of female MS patients than in 
those of males. In mouse EAE models, S1PR2 
function is responsible for the breakdown of ad- 
herens junctions, BBB leakage, and chemokine- 
dependent inflammatory responses, thus 
contributing to disease severity (117). Given 
that autoimmune neuroinflammatory diseases, 
such as MS, occur more commonly in females, 
the identification of SIPR2 as a sex-dimorphic 
regulator of the BBB permeability raises a 
number of interesting questions. For example, 
what are the mechanisms by which blood ves- 
sels in the brain of females express higher 
levels of SIPR2? Would targeting this recep- 
tor provide a therapeutic opportunity for the 
treatment of MS, especially in treatment- 
resistant female MS patients? 

In MS, autoreactive immune cells enter the 
CNS and induce inflammation, leading to de- 
myelination. In addition to SIPR1 activation in 
autoreactive lymphocytes, which is an impor- 
tant target of S1P therapeutics, SIPR1 and SIPR3 
expressed by astrocytes may influence neuro- 
inflammatory processes (112, 113). Recent studies 
suggest that SPHK1 and SIPR3 are associated 
with inflammatory phenotypes of glia (1/4, 115). 
More defined mouse genetic studies are needed 
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to precisely elucidate SIPR-mediated inflam- 
matory processes in both immune cells and 
resident microglial cells of the CNS in various 
forms of MS. This is an important issue to 
resolve for the development of more effective 
therapeutics. 

Recent studies also suggest the involvement 
of S1PR signaling in the pathogenesis of AD, 
Parkinson’s disease (PD), and Huntington’s 
disease (HD). Associations between decreased 
SIP levels with AD (716, 117), PD (178), and HD 
(119) were observed, especially in early stages 
of these diseases. In addition, defective SIPR 
signaling was seen in cell culture (720) and 
preclinical models of neurodegenerative dis- 
ease (127). Given the high unmet medical need 
for these conditions, the prominent signaling 
of SIPRs in the CNS suggests that better un- 
derstanding of the exact roles of SIP signaling 
in AD, PD, and HD could lead to new thera- 
peutic approaches. 


The pathogenesis of fibrosis 


Chronic tissue damage coupled with reduced 
resolution of inflammatory and wound heal- 
ing responses lead to fibrosis (scarring), thus 
impairing normal organ function. Abnormal 
S1P signaling plays an important role in the 
fibrosis of multiple organ systems (Fig. 3). 
When the microvasculature is injured or dys- 
functional, tissue parenchyma becomes flooded 
with plasma S1P, which leads to the activation 
of SIPRs in interstitial cells. In addition, platelet 
activation and thrombosis also cause increased 
local secretion of S1P, which leads to abnormal 
S1PR activation. Attenuated vascular S1PR1 and 
increased parenchymal and immune cell SIPR2 
and SIPR3 leads to activation of Rho GTPase 
and Hippo-Yes-associated protein (YAP) sig- 
naling pathways that drive fibrosis (76). Thus, 
pathological SIP signaling in parenchymal and 
infiltrating immune cells promotes fibrotic 
diseases of the kidney, liver, and lung. 
Acute kidney injury caused by periopera- 
tive renal ischemia and reperfusion leads to 
damage and dysfunction of the tissue, result- 
ing in up-regulation of SIPR2 expression in 
renal proximal tubule cells, which promotes 
kidney fibrosis. By contrast, SPHK1 and S1PR1 
signaling attenuated renal injury, presumably 
by salvaging the vasculature (122). Treatment 
of mice with fingolimod decreased the infil- 
tration of immune cells and fibrosis in the 
kidney after unilateral urethral obstruction 
(UUO), a model of kidney injury, likely be- 
cause of transient activation of vascular S1PR1 
and inhibition of immune cell trafficking to 
the kidney (723). In tubulointerstitial renal fibro- 
sis induced in mice, SPHK2 is up-regulated. 
Sphk2-deficient mice show less severe fibrosis 
in the UUO model, and this correlates with 
suppression of transforming growth factor-B 
(TGF-B) signaling in the kidney parenchymal 
cells and increased macrophage polarization 
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Fig. 5. S1P in the CNS. SIPRs are important for normal CNS development, neural stem cell self-renewal and differentiation, as well as neural tube closure and the 
formation of the neurovascular unit (NVU). Dysregulation of S1PRs is also implicated in stroke, subarachnoid hemorrhage (SAH), multiple sclerosis, and 
neurodegenerative diseases such as Parkinson’s disease (PD), Alzheimer’s disease (AD), and Huntington's disease (HD). 


toward pro-healing and anti-inflammatory (M2) 
macrophages (124, 125). Further, genetic dele- 
tion of Sphk2 in bone marrow in mice is suf- 
ficient to protect from folic acid-induced renal 
fibrosis through the signaling of IFN-y, CXC 
ligand 9 (CXCL9), and CXCL10 (726) on infil- 
trated hematopoietic cells as well as kidney 
parenchymal cells. Because kidney injury and 
fibrosis occur in many clinical conditions— 
including diabetes, hypertension, autoimmune 
diseases, and infections—the identification of 
vascular SIPR signaling activation as a renal 
protective mechanism has revealed numer- 
ous therapeutic opportunities such as S1PR1- 
selective agonists and SPHK inhibitors. 

SIP pathway dysregulation is also involved 
in liver fibrosis, a serious chronic disease that 
is associated with fatty liver disease, hepatitis, 
and alcoholism. In human tissues, SPHK1, SPNS2, 
and S1PR2 expression correlate with the sever- 
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ity of liver fibrosis (127). Increased liver S1P 
levels activate hepatic stellate cells, which 
are located between sinusoidal vessels and hepa- 
tocytes, to undergo fibrotic changes (127, 128), 
and inhibition of S1PR2 with a pharmacolo- 
gical antagonist attenuated liver fibrosis in mice 
(129). S1P signaling in bone marrow-derived 
macrophages recruited to the injured liver in 
mice may be involved in the amplification of 
fibrotic responses (130, 131). Liver regenera- 
tion in mice after partial hepatectomy or bile 
duct ligation is dependent on SIPR1 expressed 
by liver sinusoidal endothelial cell (LSEC) ac- 
tivation by ApoM*HDL-bound SIP and sup- 
pression of Rho signaling. Hepatic sinusoidal 
vascular remodeling and the restoration of 
functional liver tissue is impaired in Apom- 
deficient mice, which showed characteristics 
of perivascular fibrosis and thrombosis. Con- 
versely, ApoM transgenic mice showed minimal 
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fibrosis by way of endothelial S1PR1 function in 
liver sinusoids (40). 

Idiopathic pulmonary fibrosis (IPF), a pro- 
gressive lung disease that occurs in ~0.1% of 
the population worldwide, leads to disability 
and death. The S1P pathway is implicated in 
normal function of the pulmonary vasculature 
and appears to be impaired in acute lung dys- 
function while it is ectopically induced in chro- 
nic fibrosis. Patients with IPF have increased 
SIP levels in serum and bronchoalveolar lav- 
age (BAL) and increased SPHK1 expression in 
lung alveolar macrophages, which correlated 
with markers of epithelial mesenchymal tran- 
sition (EMT) and fibrosis such as a-smooth 
muscle actin (a-SMA), vimentin, and collagen 
type 1. In alveolar type II cells in the lung, S1P 
activates SIPR2 and SIPR3 and induces EMT 
through the activation of phosphorylated-SMAD3, 
RhoA-GTP, oxidative stress, and TGF-(1 release 
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(132). The use of receptor-selective agonists 
revealed that although endothelial cell SIPR1 
protects from fibrosis and promotes normal 
lung function, SIPR2 and S1PR3 expressed 
in fibroblasts and infiltrating myeloid cells 
promotes fibrosis, such as EMT marker ex- 
pression, increased proliferation of myofibro- 
blasts, and extracellular matrix deposition 
(133). The fibrosis-inducing potential of the 
DNA-damaging agent bleomycin in mouse 
models was attenuated in SIpr2- and SIpr3- 
deficient mice (134, 135). In these preclinical 
models, collaboration of inflammatory cells 
and lung parenchymal cells occurs in the com- 
plex SIP signaling network to drive fibrotic 
responses. These observations suggest ago- 
nism of vascular endothelial SIPR1 and antag- 
onism of parenchymal S1PR2 and/or SIPR3 as 
potential therapeutic approaches in fibrotic 
diseases of multiple organ systems. Whether 
the results from mouse models will be relevant 
to human fibrotic disease is not clear and 
requires large-animal preclinical models and 
ultimately clinical trials with appropriate SIPR- 
selective agents. 


Therapeutic opportunities 


Fingolimod, the first FDA-approved S1PR- 
targeted drug, is an effective first-line thera- 
py in relapsing remitting multiple sclerosis 
(RRMS). The primary mechanism of action is 
as a functional antagonist of SIPR1 expressed 
on lymphocytes (24), preventing the egress of 
autoreactive lymphocytes from secondary 
lymphoid organs and reducing their migration 
into the CNS, thus slowing disease progres- 
sion. Substantial adverse events such as first- 
dose bradycardia (reduction of heart rate) 
and macular edema (vascular leak into the 
center of the retina) limits the usage of fin- 
golimod (84). The adverse events can be ex- 
plained by on-target actions on SIPRs expressed 
by the heart and retinal vasculature. 
Currently, several clinical trials are un- 
der way to test the effectiveness of the next- 
generation S1PR-targeted agents (Table 1) 
(136). Such agents also induce functional antag- 
onism of SIPR1 in lymphocytes and are more 
selective in their interaction with other SIPRs. 
One such compound, siponimod, a selective 
antagonist of SIPR1 and S1PR5, was shown to 
have efficacy in both RRMS and progressive 
forms of MS in human patients but had simi- 
lar adverse event profiles as fingolimod (137), 
which confirms that interference with S1PR1 
in the heart and vasculature is the cause of 
these side effects. Siponimod achieved FDA 
approval in 2019. Siponimod interaction with 
SIPR5 is thought to enhance survival and/or 
differentiation of mouse oligodendrocytes, 
which are defective in MS and neurodegene- 
rative diseases (138). Subsequent studies of 
fingolimod as well as the recent phase III 
studies with siponimod (137) found that SIPR 
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functional antagonist therapy of MS patients 
protects from accelerated brain atrophy. This 
finding warrants further studies to examine 
whether this therapeutic axis is useful in neu- 
rodegenerative diseases in general. Further- 
more, because S1PR1-dependent autoreactive 
immune cell trafficking to target organs is 
common to many autoimmune diseases, this 
therapeutic approach is likely applicable to 
other non-CNS autoimmune diseases. As such, 
clinical trials are under way to treat patients 
with psoriasis, inflammatory bowel disease, 
and systemic lupus erythromatosus (Table 1). 

Biased agonists of SIPR1 may achieve vascu- 
lar protective effects in diabetes and metabolic 
and cardiovascular diseases (38). Activation 
by SIP of SIPR1 in the endothelium leads to 
receptor down-regulation (endocytosis), which 
limits the response to agonists. However, 
biased agonists of endothelial S1PR1 that do 
not induce S1PR1 endocytosis—and thus, re- 
ceptor degradation—may induce sustained 
endothelial protective effects such as NO syn- 
thesis, increased barrier function, and en- 
dothelial cell survival, among others, thus 
protecting the vasculature. However, it is pos- 
sible that such agents could have distinct 
adverse effect profiles, such as hypotension. 
Biased agonists for other GPCRs such as 
u-opioid receptor, dopamine D2 receptor, and 
angiotensin receptor-1 are being actively de- 
veloped to achieve clinical efficacy and reduced 
adverse effect profiles (139). In some cases, such 
as in the case of the 1-opioid receptor, a biased 
agonist potently achieved analgesia with lim- 
ited adverse effects (140). 

Small-molecule pharmacological antagonists 
have been used to target other SIPRs in pre- 
clinical models of inflammation and fibrosis 
(129, 131, 135, 141). This approach may poten- 
tially lead to adverse events such as hearing 
defects (35), lymphoid neoplasms (87), and 
epileptic seizures (142), which have been ob- 
served in SIpr2 genetic loss of function mouse 
models. SIPR redundancy may also lead to 
limited efficacy in targeting a single receptor 
isoform, and therefore, targeting multiple re- 
ceptor isoforms that carry out similar bio- 
logical effects may prove to be more effective. 

SIP chaperones allow an additional oppor- 
tunity for SIPR modulation. For example, ther- 
apeutic delivery of engineered ApoM—namely, 
ApoM-Fc—suppresses inflammation (39, 47) 
and promotes vascular homeostasis in mouse 
models of myocardial ischemia-reperfusion in- 
jury, chronic hypertension, and experimental 
stroke (67). In sharp contrast to small-molecule 
SIPR1-targeted drugs, ApoM-S1P treatment 
did not lead to lymphopenia, suggesting the 
inability of chaperone-bound S1P to down- 
regulate S1PR1 in secondary lymphoid organs. 
This ApoM-S1P therapeutic approach may se- 
lectively activate endothelial cell SIPRs with- 
out inducing functional antagonism of im- 
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mune cell SIPRs. However, ApoM-SIP can ac- 
tivate multiple receptors and therefore could 
potentially lead to undesirable side effects 
by activating pro-inflammatory SI1PR2 and 
SIPR3. Selective activation of SIPRs by modi- 
fied chaperone-based approaches may need to 
be developed to provide therapeutic utility. 

Selective activation of S1P transporters may 
have utility in cancer metastasis and anemia 
(34, 96). In metastasis, SPNS2 activation, which 
allows S1P export and signaling by immune cell 
SIPRs to regulate trafficking, may enhance 
CD8* T cell and NK cell infiltration in tumors 
and potentially enhance antitumor immunity. 
In anemia, activation of the MFSD2B trans- 
porter in RBCs may help reduce the concen- 
tration of sphingosine and S1P in the cell 
membranes of RBCs, which was shown to re- 
duce recovery from anemia in mice. 

The inhibition of the metabolic enzymes in- 
volved in S1P biosynthesis may also have the- 
rapeutic utility. An SIP lyase inhibitor induced 
lymphopenia and suppressed autoimmune in- 
flammation by enhancing S1P levels (Table 1) 
(143). However, broad-spectrum sphingolipid 
alterations induced by inhibition of SIP lyase 
could lead to side effects. Similarly, even though 
SPHK inhibitors have been tested in preclinical 
models to inhibit the overactive SPHK1 enzyme 
in cancer (144), recent work with highly specific 
and potent inhibitors suggest that they have 
limited utility, indicating that off-target effects 
of early SPHK-targeted agents may be involved 
(145). SPHK2 inhibitors led to increased plasma 
SIP and attenuation of inflammatory and fib- 
rotic responses (146). Because modulation of 
S1P metabolic enzymes led to broad changes in 
the metabolite concentrations and flux of not 
only sphingolipids but also other phospholipids, 
this approach will need to carefully consider 
lipid metabolism before clinical application. 


Conclusions and future perspectives 


The use of mouse models and modern phar- 
macological tools has led to our current un- 
derstanding of the biology of the S1P-SIPR 
signaling system. With the application of S1PR- 
targeted drugs in humans, much has been 
learned about the role of this pathway in the 
immune system and its potential future ap- 
plication in immunological, neurological, and 
perhaps cardiovascular and fibrotic diseases. 
SIPR1 functional antagonists that induce irre- 
versible S1PR internalization and degradation 
are the most advanced class of compounds, 
whereas competitive antagonists have not yet 
been tested in clinical trials. Perhaps function- 
al antagonism of the receptors may be prefer- 
able to block SIPRs in ligand-rich compartments 
and may improve efficacy in the clinic. How- 
ever, the emerging biological understanding 
and clinical research suggest additional oppor- 
tunities for therapeutic application, including 
transporter modulators, biased agonists of 
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Table 1. SIP modulators under development or in clinical testing. RRMS, relapsing remitting multiple sclerosis; PPMS, primary progressive multiple 
sclerosis; SPMS, secondary progressive multiple sclerosis; RMS, relapsing multiple sclerosis; UC, ulcerative colitis; CIPD, chronic inflammatory demyelinating 
polyradiculoneuropathy; ALS, amyotrophic lateral sclerosis; SLE, systemic lupus erythematosus; SCLE, subacute cutaneous lupus erythematosus; RA, 
rheumatoid arthritis; AMD, age-related macular degeneration. 
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SIPRs, chaperone-based strategies, and ligand 
neutralization approaches. Such approaches, 
which attenuate abnormal S1PR signaling and 
normalize spatial SIP gradients, may be useful 
to combat diseases in the context of the com- 
plex S1P signaling mechanisms that are prev- 
alent in many organ systems. 
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INTRODUCTION: Characterizing genetic variants 
underlying local adaptations in human pop- 
ulations is one of the central goals of evolu- 
tionary research. Most studies have focused on 
adaptive single-nucleotide variants that either 
arose as new beneficial mutations or were in- 
troduced after interbreeding with our now- 
extinct relatives, including Neanderthals and 
Denisovans. The adaptive role of copy number 
variants (CNVs), another well-known form of 
genomic variation generated through deletions 
or duplications that affect more base pairs in 
the genome, is less well understood, despite 
evidence that such mutations are subject to 
stronger selective pressures. 


RATIONALE: This study focuses on the discov- 
ery of introgressed and adaptive CNVs that 


i" 


have become enriched in specific human popu- 
lations. We combine whole-genome CNV call- 
ing and population genetic inference methods 
to discover CNVs and then assess signals of 
selection after controlling for demographic 
history. We examine 266 publicly available 
modern human genomes from the Simons 
Genome Diversity Project and genomes of 
three ancient hominins—a Denisovan, a Nean- 
derthal from the Altai Mountains in Siberia, 
and a Neanderthal from Croatia. We apply long- 
read sequencing methods to sequence-resolve 
complex CNVs of interest specifically in the 
Melanesians—an Oceanian population distrib- 
uted from Papua New Guinea to as far east as 
the islands of Fiji and known to harbor some 
of the greatest amounts of Neanderthal and 
Denisovan ancestry. 


Schematic out-of-Africa route 
to the present-day Melanesia 


i 


Neanderthal Denisovan 
8p21.3 16p11.2 
haplotype haplotype 
chromosome 8p21.3 chromosome 16p12.2/16p11.2 Melanesian i] 
9 0) 
Neanderthal haplotype Denisovan haplotype 44% 79% 
DEL ye, yp, (6 kbp) & DUP, (38 kbp) DUP 6,12 (383 kbp) ) 
vs. Vs. non-Melanesian 
Human reference genome Human reference genome ~0% 0% 


Large adaptive-introgressed CNVs at chromosomes 8p21.3 and 16p11.2 in Melanesians. The magnifying 
glasses highlight structural differences between the archaic (top) and reference (bottom) genomes. Neanderthal 
(red) and Denisovan (blue) haplotypes encompassing large CNVs occur at high frequencies in Melanesians 

(44 and 79%, respectively) but are absent (black) in all non-Melanesians. These CNVs create positively selected 
genes (TNFRSF10D1, TNFRSF1IOD2, and NPIPB16) that are absent from the reference genome. 
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RESULTS: Consistent with the hypothesis of 
archaic introgression outside Africa, we find 
a significant excess of CNV sharing between 
modern non-African populations and archaic 
hominins (P = 0.039). Among Melanesians, we 
observe an enrichment of CNVs with poten- 
tial signals of positive selection (n = 37 CNVs), 
of which 19 CNVs likely introgressed from ar- 

chaic hominins. We show 
that Melanesian-stratified 
Read the full article CNVs are significantly as- 
at http://dx.doi. sociated with signals of 
org/10.1126/ positive selection (P = 
science.aax2083 0.0323). Many map near 
or within genes associ- 
ated with metabolism (e.g., ACOTI and ACOT2), 
development and cell cycle or signaling (e.g., 
TNFRSFIOD and CDKIIA and CDKIIB), or 
immune response (e.g., JENLR1). We charac- 
terize two of the largest and most complex 
CNVs on chromosomes 16p11.2 and 8p21.3 
that introgressed from Denisovans and 
Neanderthals, respectively, and are absent 
from most other human populations. At chro- 
mosome 16p11.2, we sequence-resolve a large 
duplication of >383 thousand base pairs (kbp) 
that originated from Denisovans and intro- 
gressed into the ancestral Melanesian popu- 
lation 60,000 to 170,000 years ago. This large 
duplication occurs at high frequency (>79%) 
in diverse Melanesian groups, shows signa- 
tures of positive selection, and maps adjacent 
to Homo sapiens-specific duplications that 
predispose to rearrangements associated with 
autism. On chromosome 8p21.3, we identify 
a Melanesian haplotype that carries two CNVs, 
a ~6-kbp deletion, and a ~38-kbp duplication, 
with a Neanderthal origin and that intro- 
gressed into non-Africans 40,000 to 120,000 
years ago. This CNV haplotype occurs at high 
frequency (44%) and shows signals consistent 
with a partial selective sweep in Melanesians. 
Using long-read sequencing genomic and 
transcriptomic data, we reconstruct the struc- 
ture and complex evolutionary history for 
these two CNVs and discover previously un- 
described duplicated genes (7NFRSFIODI, 
TNFRSFIOD2, and NPIPBI6) that show an 
excess of amino acid replacements consist- 
ent with the action of positive selection. 


CONCLUSION: Our results suggest that large 
CNVs originating in archaic hominins and 
introgressed into modern humans have 
played an important role in local population 
adaptation and represent an insufficiently 
studied source of large-scale genetic varia- 
tion that is absent from current reference 
genomes. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: eee@gs.washington.edu 
Cite this article as P. Hsieh et al., Science 366, eaax2083 
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Copy number variants (CNVs) are subject to stronger selective pressure than single-nucleotide 
variants, but their roles in archaic introgression and adaptation have not been systematically 
investigated. We show that stratified CNVs are significantly associated with signatures of positive 
selection in Melanesians and provide evidence for adaptive introgression of large CNVs at chromosomes 
16p11.2 and 8p21.3 from Denisovans and Neanderthals, respectively. Using long-read sequence data, 
we reconstruct the structure and complex evolutionary history of these polymorphisms and show that 
both encode positively selected genes absent from most human populations. Our results collectively 
suggest that large CNVs originating in archaic hominins and introgressed into modern humans have 
played an important role in local population adaptation and represent an insufficiently studied source of 


large-scale genetic variation. 


pon migration out of their ancestral home- 

land, our ancestors likely experienced 

pressure to adapt to new environments. 

Genomic surveys with single-nucleotide 

variants (SNVs) have provided evidence 
for local genetic adaptations (J-4) and the 
acquisition of variants introgressed from other 
hominins and subsequently under positive se- 
lection in modern humans (5-7). Copy number 
variants (CNVs) are a form of structural var- 
iation, including deletions, duplications, and 
multi-allelic CNVs, that are >50 base pairs 
(bp) in length and affect at least five times as 
many base pairs as SNVs per diploid human 
genome (8-10). When compared with the rela- 
tive abundance of SNVs, structural variants 
are three times as likely to be associated with 
a genome-wide association signal and 50 times 
as likely to be associated with expressed quan- 
titative trait loci (e@QTL) (11). Although larger 
CNVs are generally deleterious and associated 
with disease, examples of adaptive CNVs in 
humans have been documented (12-15). How- 
ever, relatively little is known about the ex- 
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tent to which CNVs contribute to the genetic 
basis of local adaptation and, more impor- 
tantly, whether CNVs introgressed from other 
hominins may have been targets of adaptive 
selection. 

In this work, we systematically searched for 
genome-wide evidence for selective and ar- 
chaic introgressed CNVs among Melanesian 
genomes. Melanesia, a subregion of Oceania, 
is characterized by island populations distrib- 
uted across the Bismarck Archipelago and 
Bougainville Island, just to the east of New 
Guinea. Melanesians have likely developed ad- 
aptations to diet (16), infectious diseases (17), 
and body size (18) as a result of the tropical 
island environment. In addition, Melanesians 
have been relatively isolated over most of their 
history (>50,000 years), with major influences 
from populations to the west occurring only 
during the past 3000 years (19, 20). Melane- 
sians also carry some of the greatest amounts 
of ancestry from both known archaic hominins, 
Denisovans (3 to 5%) (21-23) and Neanderthals 
(1 to 3%) (24), allowing for insights into the 
role of introgression in human evolution. 


An excess of archaic hominin CNVs in 
non-African populations 


We began by constructing a database of 5135 
archaic CNVs (table S1) from previous se- 
quence read-depth analyses of genomes of 
three ancient hominins—a Denisovan (25), a 
Neanderthal (26) from the Altai Mountains in 
Siberia, and a Neanderthal from Croatia (24). 
We compared these archaic genomes to a di- 
versity panel of 17 high-coverage modern human 
genomes from the Simons Genome Diversity 
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Project (SGDP) (24). We genotyped these CNVs 
in 249 genomes from the SGDP panel (27) and 
72 nonhuman great ape genomes (28) and 
then parsimoniously classified CNV events as 
lineage-specific or shared with modern humans 
and other ape species. We found that 2684 of 
the 5135 CNV loci in modern humans are copy 
number polymorphic in the archaic genomes 
[i.e., at least one of the three archaic genomes 
has a copy number estimate not equal to 2]; of 
these, 142 CNVs are likely derived relative to 
the 72 nonhuman primate genomes, which are 
fixed at a diploid copy number (CN2) for these 
loci. We defined a CNV as hominin specific if 
it is fixed in CN2 among all nonhuman pri- 
mate samples and is variable in copy number 
in at least one hominin sample. Among the 
402 hominin-specific CNVs (table S2), 13% (51 
of 402) of the CNVs are specifically shared 
between the three Eurasian archaic hominins 
and non-African samples. The amount of shared 
CNVs between the archaic and non-African 
samples is unlikely under a null expectation 
(P = 0.039, 100,000 permutation simula- 
tions; fig. S1). Despite the small sample size 
of three archaic genomes, our result is con- 
sistent with recent evidence for archaic ad- 
mixture found in Eurasia and suggests that 
some of the shared CNVs might result from 
archaic introgression. 


Discovery of single-nucleotide and 
population-stratified CNVs 


We performed a more comprehensive compu- 
tational analysis to discover and assess patterns 
of single-nucleotide and structural variation 
in the SGDP populations (29). We identified 
population-stratified CNVs and then tested 
for selection and introgression by using SNVs 
from the diploid sequence flanking population- 
stratified CNVs (29). Using five CNV callers, 
we discovered 368,256 autosomal CNVs, of 
which 93.5% were identified by a single CNV 
caller (fig. S2 and table $3). We focused on a 
conserved call set of 19,211 CNVs constructed 
by applying a variety of quality control filt- 
ers (fig. S3) (29). We used orthogonal single- 
nucleotide polymorphism microarray data to 
determine an overall validation rate of 83.5% 
(68.2% for duplications and ~100% for dele- 
tions) (table S4) (29). 

To identify population-stratified CNVs, we 
applied three statistics quantifying the differ- 
ences in copy number between a focal popu- 
lation and all the other SGDP samples: Vg7, 
Mann-Whitney U (MWD), and Dyeaian (29). 
After removing admixed individuals (fig. $4), 
we grouped the remaining SGDP samples (n = 
249 individuals) into eight focal populations: 
sub-Saharan Africans (n = 33), Native Ameri- 
cans (n = 20), East Asians (n = 47), Europeans 
(n = 51), Melanesians (nm = 16), Middle East- 
erners (n = 22), South Asians (n = 38), and 
Siberians (7 = 22) (29). We found little to no 
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correlation between the focal population sample 
size and the number of stratified CNVs iden- 
tified by Vgr (7 = -0.16, P = 0.699) and MWU 
test (r = -0.19, P = 0.638) compared with those 
obtained by the Dyeaian test (r = -0.73, P = 
0.039) (fig. S5). In addition, there is a strong 
correlation between the Bonferroni P values of 
the MWU and Ver tests (7 = -0.65, P< 2 x 10°"); 
however, 233 CNVs have Vg > 0.1 but with 
MWU Bonferroni’s P > 0.05, indicating that 
the two tests are complementary. To conserv- 
atively identify stratified CNVs in a focal pop- 
ulation, we used the following criteria: (i) Vgr > 
0.1, (ii) Bonferroni P value of the MWU test < 
0.05, and (iii) Deaian > 0.5 (fig. S6). Although 
the number varied by population (table S5), we 
observed more stratified CNVs per population 
than expected by chance (P < 0.0105, 10,000 
nonparametric permutation simulations) (29), 
suggesting that the enrichment of stratified 
CNVs is unlikely to be the result of sampling 
errors. Because Melanesians share the most 
introgressed alleles with archaic lineages (an 
estimated 2 to 4% from both Denisovan and 
Neanderthal), we focused on this population 
for a more detailed investigation of introgressed 
and selected CNVs. 


Tests for selective and archaic introgressed 
CNVs in Melanesians 


We considered two possible origins for the 
population-stratified CNVs: The events could 
have arisen de novo or, alternatively, could have 
introgressed from other hominins and then 
subsequently risen to high frequency by the 
action of natural selection or demographic 
processes, including drift. To distinguish be- 
tween these hypotheses, we applied a variety 
of population genetic statistics that use both 
allele frequency and linkage disequilibrium 
information of SNVs flanking the CNVs for 
additional evidence of archaic hominin intro- 
gression and/or signatures of positive selec- 
tion (29). Potential biases on the test statistics 
owing to population history were controlled 
through coalescent simulations on the basis of 
the inferred best-fit demographic models for 
Melanesians. We define the significance of each 
test as the fraction of simulations with test 
statistic values greater than or equal to the 
observed value in the real data (29). A test for 
a window is significant if its P value is <0.05. 
Among the models we tested (table S6), our 
best-fit model (fig. S7; log-likelihood = —103,386) 
estimates that the ancestors of Africans and 
non-Africans diverged ~74,000 years ago [95% 
confidence interval (CI): 73,321 to 75,199 years 
ago], followed by Melanesian-East Asian di- 
vergence ~52,000 years ago (95% CI: 51,450 to 
52,591 years ago), consistent with recent re- 
ports (19). Our demographic model confirms 
moderately high gene flow from East Asians 
to Melanesians [95% CI for Nawy*MEL-EA = 
1.109 to 1.128 > 1 WVamu, effective population 
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size of anatomically modern humans; 7yr1-£, 
migration rate from East Asia to Melanesia); 
table S7]. This finding is notable because 
there is evidence for East Asian gene flow into 
Melanesians, especially in the lowland groups, 
which constitute approximately one-third of 
our Melanesian sample set (19, 20). 

Our coalescent simulations (fig. S8 and table 
S8) (29) generally recapitulated the empirical 
variation pattern of SNVs (fig. S9A) and dem- 
onstrated the utility of our approach to iden- 
tify non-neutrally evolved loci (fig. S9, B and 
C). We used these parametric coalescent sim- 
ulations to estimate significance of selection 
and introgression at individual loci. Among 
Melanesians, we identified signatures of pos- 
itive selection at 37 distinct CNV loci [P < 0.05, 
population branch statistic (PBS)] (2) (Fig. 1 
and signals of introgression [fp test (30) at 
24 and 28 CNV loci using Neanderthal and 
Denisovan genomes as archaic references, re- 
spectively]. Notably, more than 51% of the 
selective CNV candidates (19 of 37 CNVs) also 
have introgression signals at the flanking dip- 
loid sequences (Table 1 and tables S9 and 
$10)—an observation that is highly unlikely 
(P = 0.0004, 10,000 nonparametric permuta- 
tions). However, we hypothesize that this 
strong correlation is likely driven by the as- 
sociation between population-stratified CNVs 
and selection signals at their flanking sequences, 
because our analysis shows that Melanesian- 
stratified CNVs are significantly associated 
with selective signals (P = 0.0323, 10,000 non- 
parametric permutations) but not with archaic 
introgression signals (P = 0.2282, 10,000 non- 
parametric permutations) (fig. S10). 

Among the adaptive introgressed CNV can- 
didates, many map near or within genes asso- 
ciated with metabolism (e.g., ACOTI and ACOT2), 
development and cell cycle or cell signaling 
(e.g., TNFRSFIOD, CDK11A, and CDK11B), or 
immune response (e.g., JFNLR1); however, the 
patterns are complex. For example, whereas 
the ACOT] locus harbors a deletion polymor- 
phism common in East Asians (31, 32), we find 
a ~30-kbp duplication of ACOTI in only 12 
SGDP samples, including three Melanesians, 
thus indicating a multi-allelic CNV (figs. S11 
and S12). ACOT] is involved in regulating cel- 
lular balance between free fatty acids and 
acyl-CoAs, which are critical for energy ex- 
penditure and neuronal function (37). Al- 
though the ACOT7 deletion allele is common 
across populations, we found that >84% of 
Melanesians retain the ACOTI sequences (fig. 
$12). Furthermore, the major Melanesian hap- 
lotype occurs at a much lower frequency in 
other populations (<11%) and shows extended 
haplotype homozygosity (EHH), a signature of 
positive selection (figs. S11, S13, and S14). Al- 
though a similar EHH pattern was observed in 
other non-Africans, most individuals in those 
populations carry a different allele at the core 
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(figs. S11 and S14). Notably, the introgression 
signal in Melanesians at this region is located 
on the minor haplotype, not the major haplo- 
type where we observe the strongest signature 
of selection (figs. S13 and S14). The core SNVs 
(rs4903119 and rs8015976; fig. S11) on the 
major haplotype are also associated with eQTL 
differences in ACOTI, ACOT2, and the adja- 
cent gene, HEATR4 (GTEx Portal accessed 
on 24 June 2019; https://gtexportal.org), al- 
though additional work will be required to 
fully define these associations and their poten- 
tial biological consequences. 

It is also noteworthy that our analysis iden- 
tified CNVs and structural variants with strong 
signals of selection that have been observed 
in other human populations (table S9) (29). 
For example, we identify a deletion-linked hap- 
lotype at the APOBEC3A and APOBEC3B locus 
that is fixed in Melanesians but relatively rare 
in other populations (figs. S15 to S19). This 
deletion allele has been reported at high 
frequency among Oceanians (32) and is asso- 
ciated with the risk of HIV-1 acquisition (33) 
as well as breast and ovarian cancers in some 
Eurasian populations (34). At another immunity- 
related locus, the alpha-defensin (DEFAI-T7) 
gene family, we find that 87.5% of Melanesians 
have more DEFAI-T1 copies than other pop- 
ulations (figs. S20 to S23). They also carry a 
distinctive haplotype that is rare elsewhere 
(figs. S23 to S25). This locus is one of the most 
structurally dynamic regions in the human 
genome, where recurrent rearrangements have 
occurred during primate evolution and have 
been associated with diseases such as immu- 
noglobulin A nephropathy (35, 36). At chromo- 
some 17q21.31, we observe signals of selection 
and introgression. A large inversion and recur- 
rent duplications in this region have been 
associated with positive selection and pre- 
disposition to disease in Europeans (37, 38). 
Among Melanesians, we find a nearly fixed 
haplotype (97%) that is more reminiscent of 
the ancestral haplotype, which is devoid of 
both the large inversion and the duplication 
alleles (figs. S26 to S28). The Melanesian hap- 
lotype shows multiple signals of positive selec- 
tion, including significantly negative Tajima’s 
D (-2.14, P = 0.007), low nucleotide diversity 
(n< 4.3 x 10°, P = 0.042), and elevated EHH 
across this region (figs. S29 to S32). 

Although each of these loci will require more 
detailed investigation, we focus here on two of 
the largest and most complex copy number 
polymorphisms discovered among the Mel- 
anesians. Given that the DNA of the Melane- 
sian genomes in the SGDP panel comes from 
cell lines, which are subject to somatic arti- 
facts, we designed and performed experiments 
using additional blood-derived Melanesian DNA 
samples for several CNVs (fig. S33), as well as the 
two most complex CNV loci reported here, to 
safeguard against cell line artifacts. 
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Fig. 1. Candidates for introgressed archaic and selective CNVs. Joint distributions of P values for CNV stratification (x axis, Mann-Whitney U), archaic 
introgression (fp statistic, top two rows), and positive selection (population branch statistic, bottom row) tests. The archaic reference sequences used in the 
calculations of fp are Neanderthals (top row) and Denisovans (middle row), respectively. CNVs that show signatures of both positive selection and archaic 
introgression (red circles) are distinguished from loci that show signatures of introgression only (orange circles). 


Adaptive introgression of the 
Melanesian-Denisovan-specific duplication 

at chromosome 16p11.2 

The top two Melanesian-stratified CNVs (P = 
2.5 x 10°") are a 5-kbp (chrl6:21,596,722- 
21,601,720) duplication variant and a 73.5-kbp 
(chr16:22,710,041-22,783,558) duplication vari- 
ant (fig. S34) originating from chromosome 
16p12.2. Both events are largely restricted to 
Melanesians and the Denisovan archaic genome 
(Fig. 2A) and are thought to be involved in a 
single >225-kbp complex duplication (DUPy¢p12) 
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introgressed from the Denisovan genome (9). 
This region has been difficult to correctly se- 
quence and assemble, and only recently has 
the sequence structure of the ancestral locus 
(>11 Mbp) been correctly resolved [KV880768.1, 
National Center for Biotechnology Information 
(NCBI) BioProject: PRJNA31257; fig. S35] (39). 
Unless otherwise stated, we use this patch of 
sequence to define the ancestral locus. 

By using a series of polymerase chain reac- 
tion (PCR)-restriction digest assays to geno- 
type an independent set of 242 blood-derived 
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DNA samples from diverse Melanesians across 
eight different population groups (Fig. 2B and 
table S11), we confirmed that this duplication 
allele is present at high frequency in Melane- 
sians. Although DUP¢pi2 is present at high fre- 
quency in all groups [allele frequency (AF) > 
0.79], introgression is nearly complete among 
lowland populations of West and East New 
Britain, Papua New Guinea (Fig. 2B and table 
S11). Leveraging paralogous sequence variants 
that distinguish the duplicated loci, we applied 
the fp statistic and found significant signals for 
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———————————————————————————————— 
Table 1. Top candidates for adaptive introgressed CNVs in Melanesians. P values for selection and introgression scans are calculated from coalescent 
simulations. Only loci with selection test P values <0.01 are listed. Variant type is determined with respect to the human reference genome. The introgression 
statistic (fp) is calculated separately for Neanderthals (NDL) and Denisovans (DNS). mCNV, multi-allelic CNV; DUP, bi-allelic duplication; DEL, bi-allelic deletion. 
Vst is a measurement of copy number stratification. See tables S9 and S10 for all candidates. 


Size 
Locus Type 
(bp) 
chr2:97699365-97907043 207,678 mCNV 


11:24520349-24523673 3324 DEL 


*Significant test result. 


archaic introgression at both loci (P < 0.033 
and < 0.001; Fig. 1, Table 1, and table S10) using 
Denisovans, but not Neanderthals (P = 0.178; 
fig. S36), as the source of admixture. 

Because the elevated PBS and Tajima’s D at 
16p12.2 (fig. S37) are driven by paralogous se- 
quence variants that are not fully resolved, we 
sought to further investigate evidence for selec- 
tion of DUPy¢pi2 at its true genomic integration 
site. We performed a series of fluorescence in 
situ hybridization (FISH) experiments using 
probes tiled along the short arm of chromo- 
some 16 in Melanesian cell lines (Fig. 2C, figs. 
$38 and S39, and tables S12 and S13) (29). Our 
data localized the derived Melanesian dupli- 
cation to chromosome 16p11.2 between geno- 
mic coordinates of 28.93 and 30.09 Mbp (Fig. 
2D). In addition, we generated long-read, high- 
coverage (75x) whole-genome sequences from 
a Melanesian individual (HGDP00550; cell 
line from the HGDP-CEPH panel) who carries 
the DUP ¢pi2 variant. Using the long-read data, 
we further narrowed the insertion location of 
this duplication to a 200-kbp interval (29.48 to 
29.68 Mbp) adjacent to an NP/P (nuclear pore- 
interacting protein) core duplicon (fig. S40) 
(29). This is a complex region of >500 kbp of 
segmental duplication where a Homo sapiens- 
specific duplication emerged ~280,000 years 
ago and, as a result, predisposes the region to 
recurrent structural rearrangements associated 
with autism and developmental delay (15, 40). 

To sequence-resolve the DUPy¢p12 copy num- 
ber polymorphism, we generated a Melanesian 
large-insert bacterial artificial chromosome (BAC) 
library (GM10539). From these BACs, we con- 
structed two haplotypes of 222 and 133 kbp, 
partially confirming the structure of DUP¢pi2 
(9) (fig. S41 and S42). To fully assemble the 
entire locus ab initio, we used the haplotypes 
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{Newly identified genes: TNFRSFIOD1 and TNFRSF10D2. 


3.311 0.172 


0.790 1.898 0.353 


ttNewly identified gene: NP/PB16. 


as the initial seeds to pull down long-read 
Melanesian whole-genome sequence data and 
iteratively applied the Segmental Duplication 
Assembler method (41) (fig. S43 and table S14). 
The procedure generated a ~1.8-Mbp sequence 
contig spanning more than 900 kbp of complex 
segmental duplications (Fig. 3A). We confirmed 
the organization and sequence accuracy (99.86%) 
for 220 kbp of this assembly for which we have 
finished BAC sequences derived from an un- 
related Melanesian sample (fig. S44). Notably, 
the sequence-resolved assembly shows that 
the actual length of DUP p12 duplication poly- 
morphism is ~383 kbp, which is longer than 
previously thought (9). Sequence and phylo- 
genetic analyses suggest that the variant orig- 
inated from a series of complex structural 
changes involving duplication, deletion, and 
inversion events ~0.5 to 2.5 million years ago 
(Mya) within the Denisovan ancestral lineage, 
which subsequently inserted into chromosome 
16p11.2 (chr16:29,640,235-29,640,459) between 
0.2 to 0.5 million years ago (Fig. 3B, figs. S45 
and S46, and table S15). 

Within the sequence immediately flanking 
the 16p11.2 duplication block near the DUPy¢p12 
insertion breakpoint, we observe significantly 
elevated PBS (P < 0.012) and fp (P < 0.021, 
archaic = Denisovan) values (Fig. 2D). Once 
again, the introgression signal at 16p11.2 is 
absent when Neanderthals are used as the 
archaic reference [P(fp; archaic = Neanderthal) > 
0.193; fig. S47], consistent with a Denisovan 
origin. To infer the timing of the introgression 
event, we constructed DUP\¢p12 haplotypes for 
the Denisovan and Melanesian genomes by re- 
mapping the short-read data to the assembled 
Melanesian contig and KV880768.1 in addition 
to the human reference. We reconstructed the 
phylogeny of DUPy¢p12 using ~10-kbp sequen- 
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ces, where at least five high-quality Denisovan 
reads are present, and showed that the intro- 
gression event likely occurred ~0.06 to 0.17 mil- 
lion years ago (fig. S48). 

The Melanesian duplication polymorphism 
harbors extra copies of segmental duplication 
sequences that are absent from most human 
populations, including an additional member 
of the NPIP family (42). To explore the NPIP 
coding potential at this locus, we generated full- 
length nonchimeric (FLNC) transcript data 
using Melanesian (GM10539 and GM10541) 
fibroblast cell lines (29). We identified FLNC 
transcripts that maintain the same open read- 
ing frame (ORF) and encode a previously un- 
identified member of the NPIPB family, NPIPBI6 
(1206 amino acids), mapping exclusively to 
this duplication polymorphism (Fig. 3C). This 
Melanesian copy shows elevated pairwise dN/ 
dS ratios (i.e., the number of nonsynonymous 
substitutions per nonsynonymous site to the 
number of synonymous substitutions per syn- 
onymous site) when compared with other close- 
ly related NPIPB genes (RefSeq release 109) (figs. 
S49 and S50). Using a phylogenetic branch site 
test (43), we identified 32 sites as likely 
positively selected, including a cluster of 28 
amino acid differences that occur in the last 
exon of NPIPBI6 (Fig. 3C). Multiple sequence 
alignment analyses reveal that this cluster is 
due to two indel events of a repeat motif (GAG- 
CGTCTGCGGG). The first indel upstream of the 
cluster alters the frame, whereas the second 
indel located downstream subsequently restores 
the original VP/P frame. These compensatory 
mutations resulted in a novel sequence at the 
C terminus of NPIPBI6 (fig. S51), as opposed to 
the other four amino acid replacement events. 
Notably, at 30 of the 32 sites where there is 
sequence coverage (more than five reads), the 


4: of 10 


RESEARCH | RESEARCH ARTICLE 


A 
°* 
NDL @ 
DNS ? © 
SIB % S 
SA ¥% © 
MEL ‘o 2 * 
ME a a - 
EUR a. e 
EA * @ GM12878 (EUR), GM10541 (MEL), 
AMR a @ chr16p12.2, CN(DUP ¢,,2)=2 chr 6p12.2, CN(DUP,,,,.)=3 
AFR x. @ 
\\MVP_\\ TMEM219 
; SNX29P2 SLC7A5P1 C16orf54) 
i 2 = a D ir oe 19 ‘RRNaP2 ors spy 2518 9°25? spp) PPRAC 
Wi 
CopyNumber MIRAS17" 9 SLX1B KIDS e Cteorte2| _ ITBX6 
DUP TUFM | aha SLX1B suLTiAd KCTDI3\\ ay pay 
16p12 9) RABEP2 . apRT = MAZ!_ ww vp YPEL3 
CN1 CN2 CN3 CN4 eT a ; Sear eR PTTT2| — INOBOE 
ATP2A1 Vill LAT cubtiad | SMGIP2 PAGHI = DOGeAN paMs7a 
388242 ven 
a 100289092 Loce1go3e | CDIPT-AS1\ TAOK2 i 
Fosmid clone mm! = 
: CN 
B , Mengen rH | 
: 4 Tolai Kabakada au 3 
4 
= 
i Bileki Saposa : 5 
Nakanai Bileki p i 6 
i oo 
Teop a om 
10+ 
2.0 3 A 100 
Baining Marabu SNV, enhancer 
1.51 @e SNV, promoter * loga 
: SNV, synonymous & a 
nH 1.0 e e SNV, nonsynonymous 06 
CN a xg 
2 Fm 
=> 3H ‘ 
4 


Fig. 2. Evidence for adaptive Denisovan introgression of the chromosome 
16p12.2 duplication at 16p11.2 in Melanesians. (A) Copy number (CN) 
estimates of DUPy6p12 for the SGDP populations and three archaic samples. 
Pie charts indicate the CN frequency of populations (righ 
fraction of CN genotypes (bottom). NDL, Neanderthals; DNS, Denisovans; 
SIB, Siberians; SA, South Asians; MEL, Melanesians; ME, 
EUR, Europeans; EA, East Asians; AMR, Native Americans; AFR, sub-Saharan 
Africans. (B) Geographic distribution for the DUPi¢p12 duplication genotypes 
of 242 independent blood-derived DNA samples from Melanesia. The CN 
color scheme matches that in (D). (€) FISH experiments 
from 16p12.2 confirm an additional copy of DUPy6pi2 (red 
174222_ABC10_2_1_000044550500_M3 at 16p12.2; table S13; fig. S38) in a 


Denisovan genome carries the same alleles 
as in the Melanesian NPIPBI6, suggesting that 
most of these events emerged in the Denisovan 
lineage. 

Taken together, our findings suggest that the 
383-kbp duplication polymorphism in Melane- 
sians introgressed from Denisovan-like homi- 
nins and that it is likely an adaptive CNV. This 
helps to explain why this polymorphism has 
become nearly fixed within the Melanesian 
populations (>80%) despite its large size, which 
is typically regarded as selectively disadvan- 
tageous (8, 44). Our analyses support a model 
where the locus on chromosome 16p11.2 served 
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chr16p11.2 29.2 


) and the population 


iddle Easterners; 


using fosmid clones 
fosmid clone, 


(purple) indicate significan 
signals at both 16p12.2 an 


as an acceptor site for large independent dup- 
lication events twice in the recent history of the 
Homo genus—once in the H. sapiens lineage 
and once in the ancestral Denisovan lineage 
where novel duplicate genes have emerged. 
Notably, the Melanesian-specific gene NPIPB 
shows ~3% amino acid divergence and evi- 
dence of positive selection despite its recent 
origin. Introgression has generated an amal- 
gam of these two duplications, creating a mas- 
sive duplication block (~900 kbp) in Melanesian 
and related populations. Given the localization 
of this duplication polymorphism to the re- 
current autism-associated breakpoint region, 
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Melanesian cell line (GM10541, CN3) as opposed to a European cell line 
(GM12878, CN2). (D) (Top) Signals of adaptive introgression in the Melanesians 
at 16p11.2—the locus in which the DUPy¢p12 duplication was inserted. The heat 
map shows the CN distribution at chromosome 16p11.2. Fosmid clones (green: 
ABC10_000044688200_G16; blue: ABC10_000043626100_E12; table S13) 
indicate the region where the integration of DUPj¢p12 occurred at the 16p11.2 
locus. (Bottom) PBS (left y axis) for SNVs (dots) and fp [horizontal lines, 
representing windows of 100 S 
reference, right axis] at DUPjep12. Colored circles (blue) and/or horizontal lines 


Vs, computed using Denisovans as the archaic 


t test statistics (P < 0.05). Note that introgression 
d 16p11.2 disappear if Neanderthals are used as 


the archaic reference in the fp computation (figs. S36 and S47). 


we predict that these changes in the genomic 
architecture and the additional NPIPB homol- 
ogous segments (orange triangles in Fig. 3; see 
also table S16) will affect the frequency of re- 
current rearrangements associated with autism 
in these populations. 


Introgression of Neanderthal CNVs 
at chromosome 8p21.3 shows signals 
of positive selection 


Another signal of selection (PBS/Fsr) and in- 
trogression (P < 0.005) in Melanesians maps 
to a 76-kbp region on chromosome 8 (chr8:22, 
969,611-23,045,069; Fig. 4, A and B, and 
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Fig. 3. Reconstruction of the structure and evolutionary history for the 
Melanesian-Denisovan duplication at chromosome 16p11.2. (A) Structural 
comparison of chromosome 16p11.2 (human genome reference GRCh37; 
top), the structure-resolved Melanesian contig (middle) at 16p11.2, and the 
ancestral locus of DUPiépi2 at 16p12.2 (KV880768.1; NCBI BioProject: 
PRJNA31257; bottom). Colored boxes denote annotated human segmental 
duplications, and lines connecting the sequences show regions of homology. 
Duplicated segments specific to the Melanesian genome (red dashed box) 
are indicated if derived from unique (colored arrows) or previous duplication 
(colored rectangles) sequences. The region of recurrent genome rearrangements 
associated with autism is highlighted (pink shaded area). (B) Schematic 
model for the evolution of the DUPi¢,12 duplication. The schematic depicts 


structural changes over time, leading to the Melanesian architecture. 
Evolutionary timing was estimated on the basis of a series of phylogenetic 
analyses using structure-resolved sequences from 16p12.2 and 16p11.2 loci 
(31). The absence of intermediate genomes makes the order of some structural 
changes uncertain. (©) A new member of the NPIPB gene family, NP/PB16 
(1206 amino acids), in the Melanesian DUPj¢p12 sequence with predicted sites 
of positive selection. dN/dS analyses show positively selected amino acid 
substitutions at NPIPBI16 lineages (blue circles) compared with other NPIPB 
genes. Note that the cluster of massive amino acid changes (red circles) at position 
1236 to 1284 (alignment space) is predicted to result from two indel events in 

the C terminus of NPIPB16 as opposed to a series of independent amino acid 
substitution events (fig. S51). 


fig. S52). Within this region, there are two 
stratified CNVs: a 6-kbp deletion (DELyr-npL, 
chr8:22,982,302-22,988,251; CNV stratifica- 
tion P < 8.9 x 10") and a 31-kbp duplication 
(DUPiop, chr8:22,991,347-23,022,738; CNV 
stratification P < 1.5 x 10°°) encompassing 
TNFRSFIOD (tumor necrosis factor receptor 
superfamily 10D). The copy number estimates 
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of both structural variants (DELyer-npr and 
DUP op) are highly correlated among Melanesian 
samples (Pearson’s correlation r = —0.64, P < 
0.05) but not in other SGDP populations (fig. 
$53). Multiple lines of evidence, including sig- 
nals of excess heterozygosity, sequence read- 
depth analyses, and FISH assays, confirm that 


the duplication variant DUP; 9p is the major 
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allele among Melanesians (AF = 0.66) but is also 
found among a subset of Africans (AF = 0.12) 
as well as in the three archaic genomes where 
it is heterozygous (AF = 0.5) (Fig. 4B and figs. 
$53 to S55). The deletion allele DELyer-npi 
shows a restricted distribution, as it is observed 
among Melanesians (AF = 0.437), the two 
Neanderthals (AF = 1.0), and at low frequency 
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Fig. 4. Highly stratified CNVs at 8p21.3 in Melanesians and evidence for 
gene duplication and fusion followed by adaptive evolution at the 
TNFRSF10D locus. (A) Manhattan plot for the P values of window-based 

Fsz test. The horizontal dashed line indicates the genome-wide Bonferroni-corrected 
significance. (B) (Top) Distribution of heterozygous sites (short black vertical bars) 
for a subset of the SGDP samples. The gray box at the top shows the location of 
DELwet-no., Whereas the blue-red box indicates the derived TNFRSF1OD form, a 
fusion of TNFRSFIODI (blue) and TNFRSFIOD2 (red), as shown in Fig. 3C. (Bottom) 
Distributions of fp and PBS statistics, as well as CN trajectories of all samples 


among South Asians (AF = 0.013). Using a 
PCR assay, we confirmed the presence of the 
DELywrt-npt deletion allele in two indepen- 
dent sets of cell line and blood-derived Papuan 
DNA samples (fig. S56 and table S12) (29). 

To understand the evolution of DUP op, we 
computed its copy number among great apes 
(28) and samples from the 1000 Genomes 
Project (IKG) (10). The DUP 9p variant is pre- 
sent among all nonhuman primates examined 
and segregates at low frequencies across the 
1KG populations (<0.025) but is completely 
absent from others, such as the Europeans 
(figs. S53 to S55 and S57). We used BACs to 
sequence the DUP op locus from chimpanzees, 
gorillas, orangutans, and rhesus macaques (29). 
Our sequence comparison analyses showed 
that all nonhuman ape species share the same 
tandem organization for the two copies of 
TNFRSFIOD (TNFRSFIOD1 of 30.3 kbp and 
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non-Melanesian 0.04 


TNFRSFIOD2 of 33.0 kbp) (Fig. 4C and fig. 
S58). The absence of the tandem structure 
from the human reference genome and most 
extant humans suggests that the single-copy 
structure of TNFRSFIOD emerged specifically 
in the human lineage. 

Using the chimpanzee sequence, we in- 
ferred the evolutionary breakpoints at chr8:23, 
003,123-23,003,255 (GRCh37) partially overlap- 
ping the fifth intron and exon of TNFRSFIOD. 
We predict that the most common derived 
form of TNFRSFIOD in humans is a product of 
a nonallelic homologous recombination event 
between two ancestral genes (TNFRSFIODI1 
and TNFRSFIOD2). The resulting gene fusion 
creates a hybrid consisting of the first five 
exons of TNFRSFIOD2 and the last four exons of 
TNFRSFIODI. Thus, two-thirds of TNFRSFIOD1 
(~18.4 kbp) and one-third of TNFRSFIOD2 
(~11.9 kbp) are absent from most humans (figs. 
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across the region. (©) Comparison (Miropeats) of the major human allele 
versus chimpanzee genome structure, showing the tandem organization of the 
DUP op variant and the predicted gene models. (D) Branch-site test of positive 
selection (dN/dS) using FLNC transcript data shows significant selection signals 
(P = 0.005) compared with the null model and a cluster of positively selected 
amino acid substitutions at the transmembrane domain of TNFRSF1OD. 
Coding-sequence phylogeny shows significant positive selection (orange; dN/dS 
ratios > 1; P < 0.05) on specific branches. Note that the orangutan paralogous 
sequences form a single clade as a result of interlocus gene conversion (fig. S60). 


S58 and S59). Bayesian phylogenetic recon- 
struction of the loci show that two indepen- 
dent duplication events of TVFRSFIOD occurred 
in primate evolution: one in the lineage leading 
to the Old World monkey and the other in the 
ancestral lineage of all apes ~27.6 million years 
ago (95% highest posterior density: 19.9 to 
36.1 million years ago; fig. S60). In contrast, 
we estimate that the gene fusion occurred 1.1 
to 1.5 million years ago in the ancestral Homo 
lineage leading to modern humans, Neander- 
thals, and Denisovans, where it rose to high 
frequency among most human populations, 
with the exception of Melanesians. 

We assessed the coding potential of the three 
different TNFRSF10OD models by sequencing 
full-length transcripts from Melanesian (GM10541, 
CN3), European (GM12878, CN2), and chimpan- 
zee (PanTro, CN4) cell line samples (29). In chim- 
panzee cells, we identified a six- and a nine-exon 
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gene model corresponding to TNFRSFIOD1 
and TNFRSFI10D2, respectively (figs. S61 and 
S62). In contrast, the European CN2 sample 
possesses only the fusion gene, TNFRSFIOD, 
whereas the heterozygous Melanesian sam- 
ple shows all three gene models, including 
the two additional models present in chim- 
panzee (fig. S61). Although ORF analyses in- 
dicate that TNFRSFIODI carries a premature 
stop codon in exon 2, which truncates the 
canonical protein at the 59th amino acid posi- 
tion, a protein with 217 amino acids is still pre- 
dicted, owing to the use of a second start codon 
in exon 2 (fig. S62). In contrast, TNFRSFIOD2 
maintains an ORF with all nine exons consist- 
ent with the annotation in the human refer- 
ence assembly (ENST00000312584). The more 
common human fusion gene effectively de- 
letes the TVFRSFIODI premature stop codon 
(fig. S62) and thus restores the canonical ORF. 

To further assess the functional importance of 
the TNFRSFIOD duplicates, we examined FLNC 
transcript data from Melanesian (GM10541), 
European (GM12878), and chimpanzee fibroblast 
cell lines (29) and inferred orthologous gene 
models from TNFRSFIODI1 and TNFRSF10D2 
using gorilla, orangutan, and rhesus macaque 
BAC sequences. We observed a significant ex- 
cess of amino acid substitutions for the three 
gene models, especially among the African great 
ape lineages (fig. S63) (29). Using a branch-site 
test (43), we find evidence of positive selection 
for both the fusion gene and the TNFRSFIODI1 
copy on the human lineage and, more broadly, 
for both D1 and D2 in other nonhuman apes 
(P = 0.005; Fig. 4D and figs. S64 and S65). 
Specifically, we identify a cluster of positively 
selected sites corresponding to the predicted 
transmembrane domain of the genes (Fig. 4D 
and fig. S64). 

In contrast to the TNFRSFIOD duplication, 
the DELyr-npr variant is absent from all 
nonhuman apes, suggesting that it is derived 
(fig. S53). Using a PCR assay (table S17) and 
seven tagging SNVs [7” > 0.9 and normalized 
coefficient of linkage disequilibrium (D') > 0.9; 
table S18], we show that the DELyret-npr var- 
iant is largely restricted to the 1KG South 
Asian populations where it occurs at low fre- 
quencies (<0.07; Fig. 5A and figs. S57 and S66). 
All three Neanderthal genomes, including the 
low-coverage Mezmaiskayal (24), are homozy- 
gous for the DELyer-npr haplotype, suggest- 
ing that this deletion is fixed in Neanderthals 
(table S18). We identify strong Neanderthal 
introgression signals at DELyr1-npr among 
Melanesians [P(fp, archaic = Neanderthal) = 
0.003 and P(S*) = 0.043; Fig. 4B and fig. S67], 
as opposed to Denisovan introgression [P(fp, 
archaic = Denisovan) = 0.06; fig. S68]. Con- 
sistent with this, we find that all 15 DELyer-npr- 
linked SGDP haplotypes are more closely re- 
lated to the Neanderthal haplotypes than 
any other human haplotype (fig. S69), forming 


Hsieh et al., Science 366, eaax2083 (2019) 


a monophyletic clade in both the haplotype 
network (Fig. 5B) and maximum likelihood 
estimated phylogenetic tree (log-likelihood = 
-21,578; fig. S70) (31). The time to the most 
recent common ancestor of the Neanderthal 
and Melanesian DELyez-npr-linked haplotypes 
is 40,000 years ago (95% CI: 0 to 122,000 years 
ago) and that of all human DELygey-npr-linked 
haplotypes is 120,000 years ago (95% CI: 0 to 
241,000 years ago)—both are much younger 
than the divergence of modern humans and 
Neanderthals. It is worth noting that the hap- 
lotype sharing of the 18.5-kbp sequence be- 
tween modern humans and Neanderthals is 
unlikely under a simple model of incomplete 
lineage sorting (ILS) with reasonable demo- 
graphic parameters (P = 0.0403) (29). 

Because the large frequency difference of the 
DELyrt-npi haplotype between Melanesians 
and other populations can be due to the action 
of natural selection, we specifically tested this 
hypothesis. Using the deletion variant as a 
proxy for the beneficial allele and controlling 
the age of the variant in coalescent simula- 
tions (29), we find that the observed PBS value 
of the DELyprnpr deletion allele is signifi- 
cantly higher than expected under a null model 
of neutral evolution (PBS = 0.933, P = 0.0082; 
Fig. 5C). Among Melanesians, the deletion allele 
is always associated with DUPyop. This 6-kbp 
deletion corresponds to a histone H3 at lysine 
27 (H3K27) acetylation peak (skeletal muscle 
tissue), which by ChIA-PET (chromatin inter- 
action analysis by paired-end tag) analysis asso- 
ciates with multiple members of the tumor 
necrosis factor receptor superfamily—a gene 
family critical for apoptosis (45). 

From these observations, we developed a 
model to account for the recent evolution of 
this 8p21.3 locus (Fig. 5D). Our data support a 
copy number reduction of TNFRSFIOD to create 
a fusion gene approximately 1.2 to 1.5 million 
years ago in the ancestral Homo lineage. The 
presence of the ancestral TNFRSFIODI and 
TNFRSFIOD2 in some, but not all, modern 
human populations likely emerged as a result 
of ILS. A 6-kbp deletion (DELyr1-npr) located 
~5 kbp distal to the TNFRSFIOD locus in a 
putative regulatory region subsequently orig- 
inated in the Neanderthal lineage, which 
then introgressed back into the human line- 
age <120,000 years ago through contact with 
ancestral non-Africans. We predict that the 
unusually high frequency of the DELyezy-npr- 
linked haplotype in Melanesians is a result of 
natural selection. The ChIA-PET data suggest 
that this deletion may affect the expression 
of nearby genes, such as TNFRSFIOC and 
TNFRSFIOD (WashU Epigenome Browser: 
https://epigenomegateway.wustl.edu). 


Discussion 


Despite the growing body of evidence for ad- 
aptive introgression in humans (5-7), the ex- 
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tent and relative contributions of adaptive 
introgressed CNVs in human evolution re- 
main largely unexplored. Leveraging sequences 
from archaic and modern human genomes 
identifies adaptive introgressed CNV loci, which 
we resolved and characterized. The enrichment 
of selective and adaptive introgressed CNVs 
highlights their potential role in human evo- 
lution. Although we cannot rule out potential 
“hitchhiking” effects due to positive selection 
acting on nearby genes, the lack of other func- 
tional mutations of large effect (i.e., nonsyn- 
onymous variants) around our candidate loci 
suggests that these selection signals are driven 
by the stratified CNVs identified here and pos- 
sibly by the genes carried within. 

One possible confounding factor in such an 
analysis is ILS; however, we believe that this 
has had limited effect for several reasons. First, 
the time to the most recent common ancestor 
estimates between the Melanesian and archaic 
haplotypes at the 16p11.2 and 8p21.3 loci is less 
than 170,000 years, which is inconsistent with 
ILS. Second, we account for Melanesian de- 
mographic history in determining thresholds 
of significance for the individual population 
genetic tests applied here. Finally, tests of 
positive selection using dN/dS are generally 
unaffected by ILS (46, 47) and thus cannot 
account for the signals of positive selection we 
observe. It should be noted that SNVs that tag 
the CNV genotypes at the candidate loci of se- 
lection in Melanesians are generally specific 
to Melanesians, with relatively little association 
in other SGDP populations (9). For example, in 
the case of the APOBEC3A and APOBEC3B dele- 
tion variant, we find 266 SNVs from the 110-kbp 
region in almost perfect linkage disequilib- 
rium with the deletion allele in Melanesians, 
but among other East Asian populations the 
association is weaker (fig. S19). 

Although our limited understanding of 
genotype-phenotype relationships makes it 
difficult to predict functional consequences of 
the CNVs at chromosomes 16p11.2 and 8p21.3, 
we note that chromosome 16p11.2 harbors ad- 
aptive signals associated with human-specific 
gene duplication expansion that may affect 
iron regulation in human embryonic develop- 
ment (15). This locus also exhibits an en- 
richment of complex recurrent structural 
rearrangements, which predisposes humans 
to the second most common genetic cause of 
autism, accounting for ~1% of patients (15, 40). 
In the case of the 8p21.3 CNVs, the Melanesian 
DUP op allele is accompanied by a nearby 
deletion (DELyert-np1) that originated in 
Neanderthals and overlaps with known reg- 
ulatory signals in human skeletal muscle myo- 
blasts (H3K27Ac markers, UCSC Genome 
Browser). Notably, TNFRSFIOD has been re- 
ported as one of the primate-specific genes 
preferentially expressed in progenitor cells of 
the human fetal neocortex (48). 
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Fig. 5. Complex evolutionary history and evidence of adaptive archaic 
introgression at 8p21.3 in Melanesians. (A) Geographic distribution of the 
DELmev-not tag SNV, rs367585898, using the 1KG data. (B) Median-joint network 
showing that all deletion-linked haplotypes form a single clade. Colors represent 
different haplotypes (DEL and DUP refer to DELwev-np_ and DUP gp, respectively). 
Edge width indicates the number of pairwise differences between the joined 
haplotypes. (€) A highly unusual PBS of DELyev-npi in the Melanesians, 


Large genomic gains and losses have the 
potential to create genes that can affect orga- 
nismal phenotypes. With our increasing under- 
standing of gene flow between various archaic 
forms and our ancestors, it is tempting to 
hypothesize that CNV introgression from other 
hominins may have played a key role in helping 
humans migrating out of Africa adapt to new 
environments by serving as a reservoir of 
beneficial alleles. In addition, this study high- 
lights the substantial large-scale genetic vari- 
ation that remains to be characterized in the 
human population and the need for develop- 
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ment of additional reference genomes that 
better capture the diversity of our species 
and complete our understanding of human 
genes (11). 


Materials and methods summary 


We downloaded paired-end Illumina data for 
266 fully publicly available human genomes 
via the SGDP website (27) and called SNVs and 
indels using the HaplotypeCaller function in 
Genome Analysis Toolkit (GATK, v3.5). After 
quality control filtering, these variants were 


merged with genotypes from the three archaic 
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compared with those computed using variants with similar age in neutral 
coalescent simulations, supports the hypothesis of positive selection favoring 
this introgressed variant. CDF, cumulative distribution function. (D) (Top) 
Schematic model for the evolution of the DELwey-npp and DUP gp variants among 
great ape lineages. Dashed and dotted lines correspond to contemporary and 
historical lineages, respectively. (Bottom) Pie charts show the numbers of 
individuals from the 1KG and SGDP panels with or without the two CNVs. 


genomes (http://cdna.eva.mpg.de/) using BCFtools 
(v1.5). For the SGDP samples, CNV calls were 
generated using WHAMG, LUMPY (v0.2.13), 
DELLY?2 (v0.7.2), digital comparative genomic 
hybridization (dCGH), and Genome STRiP 
(v2.00.1611). We applied only dCGH to the 
three archaic genomes, owing to the lack of 
paired-end data in these genomes. Population- 
stratified CNVs were identified using Vgr, 
MWU, and Dyeaian Statistics, whereas signals 
of positive selection and archaic introgression 
were determined using the PBS, fp, and S* 
statistics (2, 30, 49). Significance of these test 
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results was determined from coalescent simula- 
tions based on the best-fit demographic models 
inferred from Qadi (50) and ADMIXTURE 
(51). Phylogenetic analyses were performed 
with a combination of RAXML (v8.2.10), BEAST 
(v2.5.0), and PAML (v14.9). We used FISH, 
sequencing of BAC clone inserts, and long- 
read sequencing data to validate CNVs and 
assess the breakpoints of complex rearrange- 
ments. The high-coverage long-read sequence 
data from the Melanesian sample was generated 
using PacBio single-molecule, real-time (SMRT) 
sequencing technology. We followed the Iso-Seq 
protocol and used PacBio sequencing (52) to 
generate the FLNC transcript data. Detailed 
descriptions of materials and methods are 
available in the supplementary materials (29). 
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NEURODEVELOPMENT 


Coordination between stochastic and deterministic 
specification in the Drosophila visual system 


Maximilien Courgeon and Claude Desplan* 


INTRODUCTION: Neuronal fate decisions must 
be coordinated across brain regions so that the 
proper stoichiometry of distinct afferents and 
their target neurons is established. Many sen- 
sory systems rely on stochastic fate specifica- 
tion to increase their repertoire of neuronal 
types and/or to establish the mosaic of distinct 
sensory receptors. Once these stochastic fate de- 
cisions have been made at the periphery, they 
must be conveyed to processing centers in the 
brain to ensure that the correct information is 
retained. This in turn requires that the proper 
number of target neurons of the proper type are 
generated and are connected to their cognate 
sensory receptors. How these stochastically de- 


A Drosophila visual system development 


Stochastic specification 

of R7 subtypes 
oyR7 Deterministic patterning 
epR7 of medulla neurons 


DRADM8s 


R7-independent specification of 
Dm8 subtypes 


yR7 
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Hu 
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termined sensory systems transmit their deci- 
sion to their targets in the brain—as when red 
(L) and green (M) cone photoreceptors connect 
to their appropriate bipolar cells in the human 
retina, or when randomly specified olfactory re- 
ceptors project to their correct glomeruli in the 
mouse olfactory bulb—has remained unknown. 


RATIONALE: In the Drosophila retina, different 
types of color-sensitive R7 photoreceptors with 
different ultraviolet spectral sensitivities are sto- 
chastically specified and distributed. By con- 
trast, neurogenesis of the optic lobes that receive 
input from photoreceptors is highly determinis- 
tic: A fixed number of neuroblasts produces se- 
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Circuit development in the Drosophila visual system. (A) During larval development, distinct subtypes 
of R7 photoreceptors are stochastically specified, whereas medulla neurons are specified through a 
deterministic program. (B) In the adult optic lobe, every R7 is connected to a Dm8 neuron of the proper 
subtype. (C) Three steps allow the propagation of the stochastic choice made by R7 in the retina to the 
optic lobe: Distinct Dm8 subtypes are prespecified and produced in excess independently of R7s. Then, 
R7-Dm8 pairs of the correct subtype are matched, controlled by the complementary cell adhesion molecules 
Dpril and DIPy for the yR7-yDm8 pair. Finally, unmatched Dm8s are eliminated by apoptosis. 
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quentially the same types of neurons following 
a rigid temporal program as well as a spatial 
program of determination. We asked how the 
stochastic choices made in the retina are prop- 
agated to the optic lobes—that is, how sto- 
chastically specified photoreceptors project and 
connect to their neuronal targets that are gen- 
erated through a highly deterministic program. 


RESULTS: We identified distinct subtypes of 
the main postsynaptic target of the ultraviolet- 
sensitive R7 photoreceptors, the Dm8 neurons. 
Each Dm8 subtype is specific to one of the three 
different subtypes of R7, and their organization 
in the optic lobe mirrors the topographic organi- 
zation of R7s in the retina. The three subtypes 
of Dm8 are prespecified early in neurogenesis 
by distinct neural progenitors, independent- 

ly from (and before) any 
photoreceptor input. These 
Read the full article | Neurons are produced in 
at http://dx.doi. excess, but only when 
org/10.1126/ the correct photoreceptor- 
science.aay6727 Dm8 pairs form do Dm8s 
Gi cena hagsnrattotend ane Ghiseines then 
to accommodate random variations in the 
production of the subtypes of R7s. Those 
Dm8s that do not find their correct R7 part- 
ner are culled by programmed cell death. We 
then showed that matching of one of the R7- 
Dm8 subtypes is mediated by two comple- 
mentary cell adhesion molecules: Dpr11 in 
one of the subtypes of R7, and its cognate 
receptor DIPy in the corresponding subclass 
of Dm8. Loss of either molecule leads to the 
death of that Dm8 subtype or its mispairing 
with the wrong R7 subtype; these phenotypes 
are similar to the loss of the corresponding 
subtype of R7 photoreceptors. These mecha- 
nisms allow the qualitative and quantitative 
matching of R7 subtypes with their target in 
the brain, and thus permit the stochastic choice 
made in R7 to be propagated to the determi- 
nistically specified downstream circuit to sup- 
port color vision. 


CONCLUSION: Our data reveal two fundamental 
principles in circuit formation: (i) The neuro- 
trophic theory suggests that neuronal survival 
depends on trophic factors provided by the tar- 
gets. We show that apoptosis is also involved in 
the quantitative and qualitative matching of tar- 
get neurons with their afferent sensory receptors, 
providing a direct link between correct synaptic 
partner choice and cell survival. (ii) As Sperry’s 
chemoaffinity theory proposed, we show that 
synaptic partner choice is controlled by two in- 
teracting cell adhesion molecules, which are part 
of two large families of interacting proteins. 
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NEURODEVELOPMENT 


Coordination between stochastic and deterministic 
specification in the Drosophila visual system 


Maximilien Courgeon and Claude Desplan* 


Sensory systems use stochastic fate specification to increase their repertoire of neuronal types. How 
these stochastic decisions are coordinated with the development of their targets is unknown. In the 
Drosophila retina, two subtypes of ultraviolet-sensitive R7 photoreceptors are stochastically specified. 
In contrast, their targets in the brain are specified through a deterministic program. We identified 
subtypes of the main target of R7, the Dm8 neurons, each specific to the different subtypes of R7s. 
Dm8 subtypes are produced in excess by distinct neuronal progenitors, independently from R7. After 
matching with their cognate R7, supernumerary Dm8s are eliminated by apoptosis. Two interacting cell 
adhesion molecules, Dpr11 and DIPy, are essential for the matching of one of the synaptic pairs. These 
mechanisms allow the qualitative and quantitative matching of R7 and Dm8 and thereby permit the 
stochastic choice made in R7 to propagate to the brain. 


tochastic specification of neurons is a 

common feature of many sensory sys- 

tems (1). In the vertebrate olfactory sys- 

tem, it is used to increase the diversity 

of olfactory sensory neuron types to a 
repertoire of more than 1400 in mice (2). In 
humans and old world monkeys, the sto- 
chastic specification of cone cells is the basis of 
the retinal mosaic responsible for trichromatic 
color vision (3, 4). A neuron that relies on an 
initial stochastic decision must stabilize its 
choice to maintain its proper identity and 
then inform its downstream target cells of 
its choice. The latter is essential for neurons 
because they need to connect to their proper 
targets to faithfully transmit information to 
processing centers. The mouse olfactory sys- 
tem offers the most noteworthy illustration 
of this matching problem: The ~14.00 olfactory 
neuron subtypes are randomly distributed 
within the olfactory epithelium (5), yet all 
olfactory neurons of the same subtype project 
to the exact same glomeruli of the olfactory 
bulb (6-8). 

In the Drosophila retina, a similar stochastic 
mechanism ensures the random distribution 
of photoreceptors with different spectral 
sensitivity (9, 10). The Drosophila compound 
eye is composed of ~750 unit eyes called 
ommatidia, each composed of eight photo- 
receptors. Ommatidia can be classified into dif- 
ferent subtypes according to the rhodopsins 
with different spectral sensitivity expressed 
by color photoreceptors R7 and R8. The main 
part of the retina is occupied by two types of 
ommatidia that are randomly distributed and 
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stochastically specified [Fig. 1A; reviewed in 
(11)]. In the yellow (y) type that represents 65% 
of ommatidia, R7 expresses the ultraviolet (UV)- 
sensitive Rh4, whereas R8—located below R7, 
and thus seeing the same point in space— 
always expresses the green-sensitive Rhé6. In 
the remaining 35% of ommatidia of the pale 
(p) subtype, R7 expresses the shorter UV- 
sensitive Rh3 and R8 expresses the blue- 
sensitive Rh5. A third type of ommatidia called 
dorsal rim area (DRA) is localized in the most 
dorsal row of ommatidia (12). In this subtype, 
both R7 and R8 express Rh3 and are respon- 
sible for detecting the polarized skylight used 
for navigation (13). 

Most of the gene regulatory network con- 
trolling the establishment of the fly retinal 
mosaic has been uncovered (17) (Fig. 1B). The 
stochastic fate decision is initially made by R7 
and is controlled by the transcription factor 
Spineless (Ss). Ss is stochastically turned on in 
65% of R7s that then adopt the yR7 fate (10, 14). 
Once this decision is made cell-autonomously 
by R7, it is propagated to R8 in the same 
ommatidium so that R7 and R8 have coupled 
rhodopsin expression. This is achieved through 
induction of the pR8 fate by pR7s through 
Activin and bone morphogenetic protein sig- 
naling, whereas the yR8 fate is specified by 
default (15). R7 and R8 send their axons to the 
medulla, the second neuropil of the optic lobe, 
where they make synapses with some of the 
~40,000 neurons of more than 80 different 
cell types that constitute the medulla (J6, 17) 
(Fig. 1C). The medulla is retinotopically orga- 
nized in ~750 columns that correspond to the 
~750 ommatidia. 

In contrast to the stochastic specification of 
photoreceptors, medulla neurons are formed 
following a stereotypic mode of development 
[reviewed in (/8)]. The medulla develops from 
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a neuroepithelium during late third-instar lar- 
val stage and early pupation. A proneuronal 
wave sequentially converts single rows of 
neuroepithelial cells into neuroblasts, the 
Drosophila neural stem cells, until the neuro- 
epithelium is fully consumed (79). The medulla 
is thus sequentially produced, similarly and 
concomitantly to the retina where single rows 
of ommatidia are sequentially added in the 
eye disk at the morphogenetic furrow (20-22). 
Once specified, medulla neuroblasts sequen- 
tially express a series of transcription factors 
that will command the fate of the neurons pro- 
duced during each temporal window (23-25). 
Thus, over time, a single neuroblast is able 
to generate a wide repertoire of different 
neurons, including the entire repertoire of 
unicolumnar neurons that are found in each 
medulla column with a 1:1 stoichiometry with 
photoreceptors (26, 27). 

Connecting photoreceptors with the correct 
postsynaptic partners is fundamental to ensure 
proper color vision. Here, we investigated how 
the stochastic decision made by photoreceptors 
is propagated to the medulla to instruct the 
formation of yellow and pale columns in which 
R7 photoreceptors connect to their proper spe- 
cific targets. We found that correct matching 
is achieved through the generation of super- 
numerary target neurons of each subtype. Neu- 
rons that fail to connect to their corresponding 
R7 photoreceptor are culled by apoptosis. Rec- 
ognition of future synaptic partners is achieved 
using a pair of interacting cell adhesion mol- 
ecules from the Dpr/DIP families expressed 
in R7 or their Dm8 targets (28). 

We argue that competition between Dm8 
subtypes for the available R7s affects both 
their survival and their targeting. This mech- 
anism of elimination of supernumerary neu- 
rons upon lack of interaction between cell 
adhesion molecules might be a general mech- 
anism to ensure the quantitative and quali- 
tative matching of synaptic pairs, and to relay 
the stochastic decisions of sensory neurons to 
deeper brain regions. 


Identification of subtypes of Dm8, the specific 
target neurons of R7 


We first sought to identify the specific target 
neurons of the distinct R7 subtypes and thus 
focused on R7s’ main postsynaptic partner, 
medulla neuron Dm8s (29, 30). The cell ad- 
hesion molecule Dpri1 is specifically expressed 
in yR7 during pupal development, and one of 
the Dpr11-binding partners, DIPy, is expressed 
in a subset of Dm8s (28) (Fig. 1, D and E). 
Because these molecules have been proposed 
to play a role in establishing synaptic speci- 
ficity in the optic lobe (28, 31), we reasoned 
that the two types of Dm8 neurons that are 
distinguished by DIPy expression could cor- 
respond to the two R7 subtypes, with DIPy* 
Dm8s being postsynaptic to Dpril* yR7s. 
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To test this, we first developed tools to ge- 
netically label the different populations of Dm8s 
according to their DIPy expression. We took 
advantage of a Minos mediated integration 
cassette (MiMIC) inserted in the first intron 
of DIPy (32) (fig. S1A) that faithfully reca- 
pitulates DIPy expression, as confirmed by 
antibody stainings against DIPy (fig. SIB). We 
swapped the green fluorescent protein (GFP) 
within the original MiMIC line with the Gal4: 
DNA binding domain to build a D/Py split- 
Gal4 line (DIPy-Gal4DBD) to label DIPy- 
expressing Dm8s, or with Gal80 to generate 
a DIPy-Gal80 to label DIPy-negative Dm8s 
(fig. S1A). The combination of the D/Py split- 
Gal4 line with a hemidriver for the hista- 
mine chloride channel o7t (ort-CI-3-Vp16) 
that is expressed in neurons postsynaptic 
to photoreceptors (29) labeled a large sub- 
set of Dm8 neurons (fig. SID). To better char- 
acterize the D/Py-expressing Dm8s and to 
look at their connectivity, we generated flip- 
out clones to sparsely label this Dm8 popula- 


Fig. 1. Identification of subtypes of Dm8, the 
specific target neurons of R7. (A) Schematic 
epresentation of the three different subtypes of 
ommatidia. (B) Regulatory network controlling 

R7 and R8 fate specification. (C) Schematic of the 
Drosophila visual system with R7 axons and their 
postsynaptic target Dm8 neurons. (D) Dpr11M!9723! 
gene-trap expression in retina photoreceptors (Elav) 
at 25 hours APF. Dprll-GFP is strongly expressed 
in yR7, labeled by Ss (outline in yellow circles) but 
absent from pR7 (cyan circles). (E) DIPy°%222 gene 
trap drives expression of GFP in the adult medulla 
(neuropil labeled by NCad). A subset of Dm8s 
(labeled by CD8::RFP) expresses DIPy (arrowhead; 
DIPy-negative Dm8s, arrow). (F) Dorsoventral 
view of DiPy-expressing Dm8 sparsely labeled with 
myr::GFP extending a single process to the M4 layer 
in its home column (arrowhead). (G) Proximodistal 
view of a DiPy-expressing Dm8 reveals tha 
yDm8s always contact a yR7 in their home column 
(yellow circle, 31/31 clones). (H and 1) pDm8s 

do not express D/Py and always contact a pR7 

in their home column [arrowhead in (H), cyan circle 
in (1); 33/33 clones]. (J) Tukey box plots repre- 
senting the number of R7s contacted per yDm8s 
and pDm8s outside of their home column, and the 
percentage of these contacts being with yR7s. 

ean (m) is represented by a cross. **P = 0.0037 
(Student t test); ns, not significant. (K and L) A 
second type of DiPy-negative Dm8 only contacts 
rAR/s. Unlike yDm8s and pDm8s, they do not have 
a well-defined home column (arrowhead) and their 
ateral processes do not contact non-praR7s 
(arrow). (M) Proximodistal view of praDm8s labeled 
by RI3E04-Gal4. Note that oraDm8s projections 
are only located at the edges of the dorsal 

half of the medulla where praR7s axons are. 
Photoreceptors are labeled with GMR-RFP and 

yR7 with Rh4-lacZ in (F) to (M). Scale bars, 5 um 
(D), (E') to (1), (K), and (L)], 20 um [(E) and (M)]. 
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tion (Fig. 1, F and G). We confirmed that Dm8 
neurons extend their dendrites in the M6 
layer, where R7 projects, each contacting ~14 
columns (30, 33) (Fig. 1, G and J). At the cen- 


ter of their dendritic field, Dm8s extended a 
much more extensive dendritic branch in their 
“home column” along the R7 axon, from the 
M6 to the M4 layer that contained most of 
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their synapses with R7 (29, 34) (Fig. 1F). 
Single-cell clonal analysis revealed that D/Py- 
expressing Dm8s always have a Rh4-expressing 
yR7 in their home column (2 = 31/31 Dm8s; 
Fig. 1, F and G), but their lateral dendrites 
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Fig. 2. Dm8 subtypes are prespecified and have distinct lineages. (A) In an 
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ology (F). 


contact either pR7s or yR7s (Fig. 1, G and J). | in combination with D/Py-Galso (fig. SIF) and 
Hereafter, we refer to D/Py-expressing Dm8s | observed two types of neurons. One popula- 
as yellow Dm8s (yDm8s). 

We next characterized D/Py-negative Dm8s | to yDm8s (Fig. 1H and fig. SIF). However, 
using two distinct Gal4 lines expressed in Dm8s | these neurons always had a pR7 in their home 


tion of Dm8s was morphologically identical 
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process in 


column (7 = 33/33 Dm8s; Fig. 11), but, similar 
to yDm8s, they contacted both pR7s and yR7s 
outside their home column (Fig. 1, land J). We 
refer to these neurons as pale Dm8s (pDm8s). 
Although both pDm8s and yDm8s showed a 
strict preference for the R7 subtype in their 
home column and contacted on average the 
same number of R7s outside their home col- 
umn (Fig. 1J), the ratio of R7 subtypes contacted 
by their lateral dendrites was different: yDm8s 
connected to yR7 versus pR7 with the same 
frequency as the distribution of these photo- 
receptors (Fig. 1J, ratio of yR7 contacted = 
61.4% versus yR7 frequency = 65%), whereas 
pDm8s had a preference for pR7s (Fig. 1J, 
ratio of pR7 contacted = 51.2% versus pR7 
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frequency = 35%). Additionally, ~15% of Dm8s 
from both populations harbored two main 
processes and thus had two home columns 
(fig. S1, H and I) that were always both oc- 
cupied by their preferred R7 subtype. 

To confirm the strict home-column pairing 
of yDm8s with yR7s and pDm8s with pR7s 
that we observed in single-cell clones, we 
looked at whole-mount stainings of either 
population (fig. S1, E and G). We never ob- 
served a yDm8 extending its main dendritic 
branch along a pR7 (fig. SJ, n = 1046) or the 
reverse for pDm8s (n = 516). We also quan- 
tified the ratio of columns occupied by a Dm8 
as a home column; 88% of yR7 columns were 
occupied by a yDm8, whereas 96.4% of pR7 
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columns were occupied by a pDm8 (fig. S1J). 
These numbers might be a lower estimate 
of Dm8s’ column coverage because the Gal4 
lines are not fully penetrant and do not label 
all neurons of a given cell type [see below 
and (35)]. 

In addition to pDm8s, we also identified a 
second type of D/Py-negative Dm8 that only 
innervated DRA photoreceptors (Fig. 1, K and 
L). These Dm8s had a morphology distinct 
from that of pDm8s and yDm8s: They did not 
appear to have any distinctive home column; 
they only made tight contacts with praR7 
termini and did not contact the M6 layer in 
the main part of the medulla innervated by 


pale and yellow R7s (Fig. 1, K and L). We also 


4 of 11 


RESEARCH | RESEARCH ARTICLE 


identified a Gal4 line that specifically labeled 
prADm8s, thereby confirming that these neu- 
rons were genetically different from pDm8s 
neurons and that praDm8s were confined to 
the outer part of the dorsal half of the medulla 
where praR7 axons are located (Fig. 1M). These 
neurons correspond to the newly identified 
Dm-DRAI neurons that were shown to be 
postsynaptic to pRAR7s (36). 

Thus, we identified three types of Dm8s, 
corresponding to the three different R7 sub- 
types. In the main part of the medulla, most 
columns are occupied by the main process of 
a single pDm8 or yDm8 with a perfect pair- 
ing of R7 and Dm8 subtypes. Thus, the topo- 
graphic organization of R7 subtypes in the 
retina is propagated to the medulla and mir- 
rored by the mosaic of Dm8 subtypes. 


Dm8 subtypes are prespecified and have 
distinct lineages 


We next sought to identify the mechanisms 
that lead from the stochastic patterning of 
photoreceptors to a deterministic output in 
the medulla, where most of the columns are 
occupied by a Dm8 with a perfect matching 
between R7 subtypes and their respective 
Dm8 subtypes. 

Two alternative mechanisms could allow 
this matching: (i) R7 subtypes could directly 
coordinate their fate with their Dm8 subtypes 
by instructing naive Dm8s during development 
to adopt the appropriate fate (p versus y). This 
would be similar to the coordination between 
R7 and R8 fates, where pR7s signal to R8s 
within the same ommatidium and instruct 
them to adopt the pR8 fate (15). (ii) Alterna- 
tively, distinct Dm8 subtypes could be speci- 
fied independently of R7 subtypes, such that 
matching would occur during later stages in 
development. 

To distinguish between the two models, we 
sought to identify the origin of the distinct 
Dm8 subtypes and asked whether distinct 
subtypes form during development, before 
R7 innervation, or whether a single naive type 
develops first. Because the Gal4 lines used 
to label Dm8 neurons in adult brains begin 
expression during late pupal development, 
we looked for markers expressed by adult Dm8 
neurons that may also be expressed during 
early development. In adults, all three Dm8 
subtypes expressed the transcription factors 
Dachshund (Dac) and Traffic jam (Tj) (Fig. 2A 
and fig. S2A). We first focused on iden- 
tifying yDm8s during development and asked 
when yDm8s adopt their final subtype fate. 
We looked at the early expression of D/Py in 
late L3 larval optic lobes and identified sev- 
eral distinct clusters of cells expressing DIPy 
(Fig. 2B). One of these clusters also expressed 
Dac and Tj (Fig. 2C) and could represent the 
yDm8 population. The identification of larval 
yDm8s based on the markers expressed in 
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adult Dm8s assumes that their expression is 
maintained throughout development. We con- 
firmed that this was indeed the case by fol- 
lowing the Dac*Tj*DIPy* cell cluster from L3 
until we could identify these neurons as yDm8s 
by their morphology (Fig. 2, D to F). Thus, in 
late L3 stage, when medulla neurons are just 
born, yDm8s have already acquired their fi- 
nal identity and express DIPy. 

We then looked for other Tj and Dac double- 
positive cells in the developing larval optic 
lobe that could represent the other two Dm8 
subtypes. We found four other large clusters 
of Dac*Tj* neurons (Fig. 2C). Unlike for yDm8s, 
we could not trace pDm8s and praDm8s from 
larva to adult because of the lack of a marker 
equivalent to DIPy. Thus, to identify which 
cluster corresponded to which Dm8 subtypes, 
we used lineage trace experiments with the 
FLEXAMP memory cassette (fig. S2D) (37). 
When using the ¢j-Gal4 line, which faithfully 
recapitulates Tj expression in larvae (fig. S2C), 
in combination with DIPy-Gal8s0, we consist- 
ently obtained clones of pDm8s and praDm8s 
(fig. S2E). 

The medulla neuroepithelium is divided 
along the dorsoventral axis into compart- 
ments based on the expression of spatially 
restricted factors (26, 37): dpp, Optix, and 
Vsx1 expression define the three major com- 
partments; Optiz is in the two arms of the 
medulla neuroepithelium (Fig. 2C) and dpp 
in the two lateral parts. The ventral half can 
also be defined by its early expression of 
hedgehog (hh) (Fig. 21). We used lineage tools 
to identify the neuroepithelium compart- 
ments from which the different types of Dm8s 
originate. 

We first used a lineage tool for the main 
medulla neuroepithelium compartment using 
Optix-Gal4 (Fig. 2C) and for the central me- 
dulla neuroepithelium using pxb-Gail4 (Fig. 
2G). In larvae, the majority of Dac*Tj* neu- 
rons came from the Optix region; two clus- 
ters were present in the ventral half, including 
the one expressing DIPy. A third one was in 
the dorsal half (Fig. 2C), whereas a smaller 
cluster was in the pzvb region (Fig. 2G). We 
used these lineage tools to trace yDm8s, 
pDm8s, and praDm8s in adults. All three 
subtypes were labeled by the Optix lineage 
tool (fig. S2, F and G), whereas no Dm8s 
were labeled by the pxb lineage tool (Fig. 2H 
and fig. S2H). We then traced neurons coming 
from the ventral half of the medulla neuro- 
epithelium by using hh-Gal4 lineage trace 
(Fig. 21) to identify whether the two Optix- 
derived Tj‘Dac*DIPy~ clusters were the pDm8s 
and praDm8s. pDm8s (and yDm8s) were la- 
beled by the Ah’ lineage trace, but none of the 
prADm8s were (Fig. 2, J and K). Therefore, 
pDm8s come from the ventral Optix cluster, 
next to yDm8s, whereas praDm8s are part 
of the dorsal Optix-derived cluster (Fig. 2L). 
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This shows that the three Dm8 subtypes come 
from three different neural progenitor do- 
mains and thus have distinct lineages (Fig. 2L). 
The distinct fates of Dm8 subtypes are thus 
preestablished independently of the specifi- 
cation of their presynaptic R7 subtype. 


Change in R7 ratio affects formation of 
Dm8 subtypes 


Because the Dm8 subtypes are specified in- 
dependently of yR7 and pR7, how can the brain 
accommodate stochastic changes in the ratio of 
photoreceptor subtypes to ensure that pDm8s 
always connect to pR7s and yDm8s to yR7s? To 
address this, we used mutations that affect the 
specification of the different ommatidial sub- 
types and looked at the consequences for the 
formation of Dm8s (Fig. 3A). We first focused 
on yDm8s: In the wild type, their arboriza- 
tions covered the M6 layer almost entirely, 
and their main branch that reached M4 in 
their home column could be easily identified 
(Fig. 3B). We examined the effect of the ab- 
sence of yR7s on yDm8s by using a retina- 
specific allele of spineless in which yR7s are 
not specified and the main part of the retina 
is solely composed of pR7s (R. Johnston, per- 
sonal communication). In these ss mutants, 
large areas of the M6 layer were devoid of 
yDm8s because of a decrease in the number 
of yDm8s (Fig. 3, C and G; number of yDm8s 
per optic lobe, WT = 345, ss = 93). Further- 
more, remaining yDm8s lacked a home col- 
umn, as seen by the absence of the typical 
main Dm8 arbor reaching the M4: layer (Fig. 
3C). Similar effects, although varying in mag- 
nitude, could be seen in two other genetic 
backgrounds that lack yR7s: In seveniless (sev) 
mutants, where R7s are not specified (Fig. 3A), 
there was a similar decrease in the number 
of yDm8s (Fig. 3, D and G, sev = 95). How- 
ever, in contrast with ss mutants, some yDm8s 
appeared to still have a home column, al- 
though their main processes were thinner 
and reached higher in the medulla to layer 
M3 where they wrapped around R8 termini 
(Fig. 3D). We also converted the entire retina 
into DRA ommatidia using homothorax gain- 
of-function (IGMR-hth, Fig. 3A) and we ob- 
served a similar, although weaker, decrease 
in the number of yDm8s (Fig. 3G, WT = 345, 
IGMR-hth = 184) and a total absence of yDm8s’ 
home column (Fig. 3E). 

We next performed the reverse experi- 
ment and converted the entire retina into 
yellow ommatidia by overexpressing Ss in 
photoreceptors (IGMR-ss, Fig. 3A). In this 
case, almost every R7 was occupied by a 
yDm8 home column (Fig. 3F) and the num- 
ber of yDm8s increased by 25% (WT = 345, 
IGMR-ss = 430). 

We also quantified the number of pDm8s 
and obtained similar results: In the absence 
of pR7s (IGMR-ss, sev), the number of pDm8s 
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dropped substantially (Fig. 3H; WT = 102, 
IGMR-ss = 3.4, sev = 17), whereas it increased 
in ss mutants (Fig. 3H; ss = 139). Taken to- 
gether, these data indicate that the numbers 
of yDm8s and pDm8s are affected by the 
numbers of yR7s and pR7s in the retina. 
In ss gain-of-function (IGMR-ss), most yR7s 
were occupied by a yDm8 home column (Fig. 
3F); the 25% increase in yDm8 number alone 
could not account for such a strong effect 
(Fig. 3G). We speculated that in addition to 
the increase in cell number, an increase in the 
number of home columns covered by indi- 
vidual yDm8 could explain such an effect. We 
therefore generated single-cell flip-out clones 
of yDm8s in ss gain-of-function to look at 
their morphology. In the wild type, 16.1% 
of yDm8s had two home columns (fig. $1, H 
and I), whereas in ss gain-of-function, 45.5% 
of yDm8s had two home columns (Fig. 3, I 
and J). We never observed yDm8s with more 
than two home columns. Thus, two mecha- 
nisms allow adult yDm8s to accommodate 
changes in the number of their R7 presynaptic 
partners: (i) their numbers increase or de- 
crease overall to match the number of yR7s, 
and (ii) when excess yR7s are present, indi- 
vidual yDm8s also increase the number of 


DIPy-GFP Dac Tj 


Dac in DIPy-GFP* cells 


WT + DIPy-Gal4>P35 
yDm8>CD8::GFP GMR-RFP Rh4-lacZ 


M6 layer|F’ 
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columns they occupy, such that most yR7s 
are covered. 


Apoptosis of excess Dm8s ensures the 
numerical matching of R7s and Dm8s 


Our data indicate that although the different 
types of Dm8s are prespecified, their number 
can be adjusted to accommodate the ratio of 
their presynaptic R7s. One hypothesis is that 
in ss mutants, yDm8s are produced normally, 
but in the absence of their yR7 partners, they 
are eliminated during a later stage of de- 
velopment, whereas when yR7s are in excess 
(IGMR-ss), all yDm8s are maintained. To test 
this, we first looked at the number of yDm8s 
early in development in ss mutants that lack 
yR7s. At 20 hours after puparium formation 
(APF), when the neuroblasts no longer divide 
and thus no more medulla neurons are pro- 
duced, similar numbers of yDm8s were found 
in ss mutants and in the wild type (Fig. 4, A, C, 
and E; WT = 437.5, ss = 416.5). By 40 hours 
APF, however, the number of yDm8s decreased 
to approximately the number observed in the 
adult in both the wild type and ss mutants 
(Fig. 4, A to E; 40 hours APF: WT = 351, ss = 
113; adult: WT = 345, ss = 93). This confirms 
that in the absence of yR7s, yDm8s are still 


Dac in DIPy-GFP* cells E 


500 


Fig. 4. Apoptosis of excess Dm8s ensures the numerical matching of 
R7s and DmBs. (A to D) D/Py-GFP expression in the optic lobe in WT [(A) and 
(B)] and ss mutant [(C) and (D)] at 20 hours [(A) and (C)] and 40 hours 
APF [(B) and (D)]. yDm8s were labeled by segmenting the Dac staining 
from the GFP staining [(A’) to (D’), gray]. Scale bars, 20 wm. (E) Number of 
yDm8s per optic lobe (DiPy-GFP*,Dac* cells, n = 4 to 6 optic lobes per 
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produced normally but numerical matching 
with yR7s happens during pupal development. 

The decrease in the number of yDm8s in ss 
mutants might be due to the death of yDm8s 
that have failed to find their correct R7 sub- 
type. Inhibiting apoptosis with tj-Gal4 to mis- 
express the caspase inhibitor P35 in yDm8s 
restored the number of yDm8s in ss mutants 
to the number found at 20 hours APF (Fig. 4E; 
adult: WT = 345, ss = 93, ss + P35 = 406, P20: 
SS = 416.5). We could also rescue the decreased 
number of yDm8s in the wild type by mis- 
expressing P35 (Fig. 4E). The final number 
of adult yDm8s obtained upon cell death 
inhibition was similar in the wild type and 
in ss mutants, and was also similar to the 
number in the ss gain-of-function and in the 
wild type at 20 hours APF (Figs. 3G and 4E; 
WT-20hAPF = 437.5, IGMR-ss = 430, WT + 
P35 = 410). This shows that during develop- 
ment, a fixed number of yDm8s are produced 
in excess but the relative number of yDm8s 
surviving depends on the number of their 
available presynaptic yR7s: Naturally occurring 
cell death can be rescued by providing more 
yR7s in ss gain-of-function, whereas it can 
be greatly increased by eliminating yR7s in 
ss or sev mutants. However, about 25% of 
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genotype). Data are means + SD. ****P < 0.0001 (one-way ANOVA, Tukey 
test). (F) Proximodistal view of yDm8s in WT upon cell death inhibition by 
misexpressing P35 in D/Py-expressing neurons. A single yDm8 mispaired with 
a pR7 (cyan circle) showed by dense GFP staining at the level of the M6 layer 
(F') and its home column at the level of the M5 layer [(F") and (F'’)] 
(mispaired yDm8s = 0.7%, n = 428). Scale bar, 5 um. 
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yDm8s are still found in the absence of any 
yR7s (Fig. 3G), which suggests that some 
yDm8s that are not connected to yR7s can 
still survive. This could be because they re- 
ceived trophic support from other neuronal 
partners, either from neurons that are normally 
connected to R7s (e.g., their postsynaptic target 
Tm5b) or through ectopic connections that 
they form with other neurons (e.g., pR7). 

We propose that this mechanism is suffi- 
cient to obtain the perfect matching observed 
in the wild type, and that cell death plays an 
essential role in coordinating the size of the 
Dm8 populations with the ratio of y/p R7 sub- 
types in the retina. 


Physiological apoptosis regulates Dm8 wiring 


We next tested whether the physiological cell 
death might be important for the proper 
wiring of yDm8s by looking for mispairing 
of yDm8s with pR7s when cell death was 
abolished in an otherwise wild-type back- 
ground. We did observe a very low but sig- 
nificant frequency of yDm8s mispaired with 
pR7s (Fig. 4F and fig. SIE; WT + P35 = 0.7%; 
n = 428). This suggests that the great majority 
of undead yDm8s still manage to integrate 
into the proper circuitry. 


yDm8 morphology and survival are affected in 
DIPy and dpr11 mutants 


We next sought to identify the mechanisms 
that control the pairing of Dm8s with their 
specific R7 subtypes and to investigate the 
role of Dpril and DIPy in the process. Dprs 
and DIPs are two closely related families of 
immunoglobulin-containing cell adhesion 
molecules (38). Each of the 21 Dprs binds to 
one or several of the 11 DIPs, and these in- 
teractions are required for their neurogenic 
function (39, 40). Because of the striking com- 
plementary expression pattern of Dpr and 
DIP pairs in synaptic partners, these families 
have been proposed to play a role in synaptic 
partner matching (28, 31, 39). Dpr11 and DIPy 
are ideal candidates for the matching of yR7s 
and yDm8s: dpr1lI expression is specific to 
yR7s (28) (Fig. 1D) and depends on ss, as 
dpri1 is lost from yR7s in ss mutant retinas 
(Fig. 5A), whereas ss overexpression in photo- 
receptors is sufficient to induce dpril ex- 
pression in all R7s (Fig. 5B). dpr11 is widely 
expressed in the optic lobe, especially in adult 
brains (fig. S3A). However, at 25 hours APF, 
around the time dpril expression peaks in 
yR7s, it is relatively restricted to yR7s in the 
M6 layer while still being broadly expressed in 
other medulla layers (Fig. 5C). By that time, 
yDm8s have already reached the M6 layer and 
have contacted R7s (Fig. 5C) but do not have an 
obvious phenotype in D/Py mutants (Fig. 5D). 

In adults, however, the mutant phenotypes 
for dpril or DIPy were quite obvious: yDm8s’ 
number and innervation of the M6 layer were 
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decreased (Fig. 5, E and J; WT = 345, DIPy = 
130, dpriI = 132). This reduction was also 
due to apoptosis during development, as the 
DIPy phenotype could be rescued by mis- 
expressing P35 in yDm8s (Fig. 5J; DIPy + P35 = 
432). This phenotype is similar to what was 
reported for mutants for DIPa and its two 
Dpr partners, dpr6 and dpri0, where a pro- 
portion of the three D/Pa-expressing Dm neu- 
ron types were shown to undergo increased 
apoptosis (39). 

Additionally, yDm8 morphology was af- 
fected in both mutants. D/Py mutant yDm8s 
failed to extend a proper process in their home 
column (Figs. 3B and 5E) and only had a short 
protrusion at the center of their dendritic field 
(Fig. 5, G and H). In dpril mutants, yDm8s 
had a similar but weaker phenotype and ex- 
tended a thin process in their home column 
(Fig. 5, G and I). Rescuing cell death in yDm8s 
was not sufficient to rescue the morphology of 
their dendritic extension in their home col- 
umn (fig. S3B). 

Thus, dpril and DIPy mutants phenocopy 
the loss of yR7s (Fig. 3), supporting a model 
that yDm8s in these mutants are unable to 
recognize yR7s and thus do not receive the 
trophic support required for their survival. 
If this was indeed the case, (i) overexpress- 
ing dprii in R7s should compensate for the 
loss of yR7s in ss mutants, and (ii) the loss of 
DIPy should be epistatic over the ss gain-of- 
function. Overexpression of dpriZ in photo- 
receptors increased the number of yDm8s in 
the wild type and could also rescue yDm8& cell 
death and morphology in ss mutants [Fig. 5K 
(IGMR-dpri1 = 400, ss + IGMR-dpril = 382) 
and fig. S3D]. Conversely, increasing the num- 
ber of yR7s using ss gain-of-function was not 
sufficient to rescue cell death of yDm8s in a 
DIPy mutant background [Fig. 5K (DIPy = 130, 
DIPy + IGMR-ss = 154) and fig. S3C]. 


DIPy and Dpr11 regulate the pairing between 
yR7s and yDm8s 


These results imply that Dpr1l and DIPy me- 
diate the strict matching of yDm8s with yR7s. 
If true, we would expect to observe mispairing 
of the remaining yDm8s with pR7s in either 
mutant because of the inability of yDm8 to 
recognize yR7s. In the wild type, yDm8 home 
columns were always located along yR7 and 
we never observed mispairing with a pR7, 
either in single-cell clones (Fig. 1, E and F) 
or whole mounts (Fig. 6A; number of yDm8s, 
n = 1046). We first tested whether D/Py and 
dpri1 mutants exhibited defects in yDm8 and 
yR7 pairing. We did observe mispairing in 
both mutants (Fig. 6, B and C) where ~5% of 
surviving yDm8s were paired with pR7s (Fig. 
6D; DIPy = 4.7%; dpril = 4.8%). The ratio of 
yR7s contacted by DIPy mutant yDm8 lateral 
dendrites was also decreased (fig. S4B; WT = 
61.4%, DIPy = 51.1%) without affecting their 
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overall dendritic field size (fig. S4A; WT = 
12.9, DIPy = 12.5). 

We next tested whether D/Py overexpres- 
sion in pDm8s or dp7i1] in pR7s was sufficient 
to generate mispairing of Dm8s with R7s. 
When DIPy was sparsely overexpressed in 
pDm8s MARCM clones from late third larval 
instar stage onward using tj-Gal4, pDm8s were 
always mispaired with yR7s (Fig. 6E, 2 = 12/12). 

We also misexpressed D/Py in two other 
medulla neuron types: Dm11 and Dm12. Dm11s 
project to the M6 layer and have multiple 
processes going along multiple R7s (fig. S2E) 
but do not show a preference for a R7 subtype 
(fig. S4C). DIPy overexpression in Dm11s was 
sufficient to induce these processes to exclu- 
sively occupy yR7 columns (fig. S4D). How- 
ever, misexpression of D/Py had no effect on 
Dmi2 neurons (fig. S4, E and F). 

We also overexpressed dpril in all photo- 
receptors and looked at whether this was suf- 
ficient to create mispairing between yDm8s 
with pR7s. Indeed, 20% of yDm8s extended 
their home column in pR7 columns (Fig. 6F, 
n = 5/23). Taken together, these data show 
that Dprl1-DIPy interaction is sufficient to 
promote pairing of Dm8s with yR7s, whereas 
lack of Dpril and DIPy only causes 5% of 
yDm8s to mispair with pR7s. 

Two hypotheses could explain the discrep- 
ancy between the requirement and the suffi- 
ciency of dpril and DIPy: (i) In the absence of 
DIPy, pDm8s are unaffected and thus target 
pR7s, leaving no space for mutant yDm8s to 
target pR7s. One could test this by looking at 
yDm8 pairing in a D/Py mutant where pDm8s 
were ablated. Because this experiment was not 
technically feasible, instead of removing the 
competition with pDm8s, we allowed D/Py 
mutant yDm8s to compete with both pDm8s 
and wild-type yDm8s. We looked at the pair- 
ing of DIPy homozygous mutant yDm8s in a 
mosaic animal using MARCM where most 
yDm8s were heterozygous for D/Py. Because 
DIPy mutant yDm8s would normally undergo 
apoptosis (see below and fig. S6), we rescued 
cell death by misexpressing P35 in the mutant 
clones. In these conditions, half of the D/Py 
mutant yDm8s mispaired with pR7s (Fig. 6H, 
n = 7/14), whereas in single-cell flip-out clones 
in whole D/Py mutants, we did not observe a 
single mispaired yDm8 (Fig. 6G; n = 0/15). 
Thus, in this experimental setup where pR7 
and yR7 columns are equally accessible, yDm8s 
evenly distribute between the two, which sug- 
gests that DIPy is required for yDm8s to pair 
with the proper R7 subtype. (ii) A second ex- 
planation could be that because two-thirds of 
yDm8s die during early pupal development in 
DIPy mutants after they are unable to find a 
yR7, they would be able to mistarget if cell 
death was rescued. Indeed, when we rescued 
cell death in D/Py mutants, there was a factor 
of 3 increase in yDm8 mispairing (Fig. 6D; 
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Fig. 5. yDm8 morphology and survival are affected in DIPy and dpril 
mutants. (A and B) dprll-GFP expression in retinas at 25 hours APF in ss 
mutant (A) or ss gain-of-function (B). Prospero labels R7s; Elav labels 
photoreceptors. Large ellipse indicates a single ommatidium and smaller ellipse 
denotes the R7 cell. Note that dprll-GFP expression is not lost in a single 
outer photoreceptor in ss mutant, whereas weak dpr11-GFP is also seen in a 
few outer photoreceptors in the ss gain-of-function. (©) dprll-GFP expression 
in medulla at 25 hours APF. The split-Gal4 line D/Py-Gal4DBDNOK371-Vp16 
drives CD8::RFP in yDm8s. In the M6 layer, dprl1-GFP is mainly seen in yR7s 


DIPy = 4.7%, DIPy + P35 = 14.7%). However, 
because not every yDm8 that was prevented 
from dying mispaired with a pR7, this suggests 
that cell death is not the result of the culling of 
mispaired yDm8s; rather, the implication is 
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that preventing cell death makes them more 
competent to compete with pDm8s to occupy 
pR7 columns. 

Taken together, our data indicate that Dpri1 
and DIPy mediate the pairing of yDm8s with 
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axons (arrows). (D) DIPy mutant yDm8s labeled with CD8::GFP at 25 hours 
APF. (E and F) Dorsoventral view of yDm&s in D/Py (E) and dpr1l (F) mutants. 
Arrowheads indicate morphological defects in yDm8 home columns. 

(G to I) Sparsely labeled yDm8s in WT (G), DiPy mutant (H), and dprll 
mutant (I); arrow and arrowheads indicate the WT or defective Dm8 process 
in the home column. (J and K) Number of yDm8s per optic lobe 

(DIPy-GFP* Dac’ cells, n = 4 to 7 optic lobes per genotype). Data are 

means + SD. ***P < 0.001, ****P < 0.0001 (one-way ANOVA, Tukey test). 
Scale bars, 5 um [(A), (B), (C’), (D), and (G) to (I)], 20 um [(C), (E), and (F)]. 


their presynaptic partner yR7s. It is noteworthy 
that we did not find any other DIPs expressed 
in pDm8s or pR7s, whereas Dm8s express mul- 
tiple Dprs based on RNA sequencing data (17). 
This suggests that the matching between pR7 
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and pDm8s uses cell adhesion molecules dis- 
tinct from Dprs and DIPs. 


Targeting of the M6 layer by yDm8s is not 
affected in DIPy mutants and in DIP 
gain-of-function 

On the basis of their layer-specific expression 
in the medulla and misexpression experiments, 
it was proposed that Dprs and DIPs regulate 
layer targeting (39). In either dpril or DIPy 
mutants, yR7s or yDm8s do not mistarget but 
instead elaborate processes in the appropriate 
M6 layer (Fig. 5, E and F). Because the mis- 
targeted cells could have been those elimi- 
nated by apoptosis, we looked at yDm8s in 
DIPy mutants where cell death was abolished, 
as well as during development when apoptosis 
happens. We did not observe mistargeting to 
another layer in either case (Fig. 5D and fig. 
S3B). To test whether D/P overexpression was 
sufficient to mistarget neurons to an improper 
layer, we misexpressed DIPs in different neu- 
ronal populations. As described above, over- 
expression of D/Py in Dm12s that normally 
innervate the M3 layer was not sufficient to 
make them target the M6 layer or other layers 
where DIJPy is expressed (fig. S4, E and F). We 
also tested whether replacing DIPy by other 
DIPs normally expressed in different layers 
would be sufficient to retarget yDm8s to these 


WT 


DIPy 
GMR-RFP Rh4-lacZ 


Fig. 6. DIPy/Dpr1i regulates the pairing between yR7s and yDm8s. (A to 

C) Proximodistal view of yDm8s labeled with CD8::GFP in WT (A), DIPy mutant (B), 
and dprll mutant (C), either at the level of the M6 layer [(A) to (C) and (A’) to 
(C')] or M5 layer [(A") to (C")]. Some yR7 or pR7 columns are indicated by yellow 
and cyan circles, respectively. In WT, yDm8s occupy most yR7 columns but never 
occupy pR7 columns (A"). In DIPy and dprll mutants, many yR7 columns are 
devoid of yDm8s, and some yDm8s contact pR7s (cyan circle). (D) Quantification 
of yDm8s mispairing with pR7s (number of yDm8s: WT, n = 1046; DIPy, n = 223: 
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layers (fig. S5). Overexpression of D/P6 in DIPy 
mutant yDm8s had no effect (fig. S5E), but 
overexpression of D/Po led some yDm8s to 
send small processes to the M3 and M7 layers 
(fig. S5C), where DIPoa and its two ligands are 
expressed (fig. S5, B, F, and G). Thus, the tar- 
geting of yDm8s to the M6 layer is indepen- 
dent of Dpr11-DIPy interaction, and yDm8s 
cannot be efficiently retargeted to different 
layers by ectopic expression of other DIPs. 
However, the extension of their home column 
requires D/Py, and ectopic expression of DIPo 
leads to the formation of small extensions 
reminiscent of Dm8s’ home column. Taken 
together, these results suggest that DIPs do 
not play an important role in layer targeting 
in the visual system but instead are involved 
in the matching of synaptic pairs. 


Competitive interactions between Dm8s 
regulate survival and wiring 


The mispairing of yDm8s was much enhanced 
when they had to compete for targeting with 
wild-type yDm8s (see above). We thus asked 
whether survival was also affected by compe- 
tition among Dm8s. We generated DIPy yDm8 
mutant MARCM clones in either a D/Py het- 
erozygous background or in a D/Py mutant 
background where yDm8s both outside and 
within the clone had identical genotypes (fig. 


S6A). If competition among yDm8s played a 
role in survival, we would expect to see differ- 
ences in clone size. In the heterozygous back- 
ground, we obtained small clones of yDm8s 
(fig. S6B; yDm8s per clone: 2 + 2.6, mean + 
SD) whereas in D/Py mutant background, we 
obtained clones of significantly larger size 
(fig. S6B; yDm8/clone: 10.8 + 3.5). Because 
these experiments rely on generating clones 
of the same size, we controlled for clone size 
by quantifying the number of pDm8s per clone. 
In both conditions, we obtained similar num- 
bers of pDm8s (fig. S6B; pDm8s per clone: WT 
background, 29.6 + 10.3; DIPy background, 
28.6 + 6.3), thereby confirming that the dif- 
ference in yDm8 number comes from a dif- 
ference in survival. Thus, D/Py mutant yDm8s 
are much more likely to survive when they 
compete with D/Py mutant yDm8s rather 
than with wild-type yDm8s. 

We noticed that affecting yDm8s sometimes 
yielded unexpected effects on pDm8s that 
could also be explained by competitive inter- 
actions among Dm8s: In D/Py and dpriI mu- 
tants, the number of pDm8s increased (fig. 
S6D; WT = 102, DIPy = 135, dpril = 136). Be- 
cause neither D/Py nor dpril is expressed in 
pDm8s, the increase in pDm8 number might 
result from the decrease in the number of 
yDm8s. We thus explored the nonautonomous 
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dprll, n = 478; DIPy;DIPy-Gal4>P35, n = 251; WT;DIPy-Gal4>P35; n = 428). 

(E) DIPy overexpression in pDm& MARCM clones is sufficient for mispairing of 
pDm8s with yR7s (n = 12/12). (F) yDm8 flip-out clone with dprll overexpressed in 
all photoreceptors. Some yDm8s are mispaired with pR7s (n = 5/23). (@) yDm8& 
flip-out clone in DIPy background; 100% of the yDm8 clones obtained are paired 
with yR7s (n = 15/15). (H) DIPy mutant MARCM clone of a yDm8 expressing 
myr::GFP and P35 in an otherwise heterozygous background; 50% of the yDm8 
clones obtained are mispaired with pR7s (n = 7/14). All scale bars, 5 wm. 
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effects on pDm8 survival. We first asked 
whether promoting survival of yDm8s would 
affect pDm8 survival when we increased the 
number of yDm8s without affecting pR7 
specification: In dpri1 gain-of-function (IGMR- 
dpr11), the number of yDm8s increased, with 
a corresponding >50% decrease in the num- 
ber of pDm8s (fig. S6D; WT = 102, IGMR- 
dpril = 41). We obtained the same effect in 
the absence of any yR7s when dpr1l was over- 
expressed in a ss mutant (fig. S6D; ss = 139, 
s8+lGMR-dpr11 = 32). We also asked whether 
decreasing the number of yDm8s (by mutat- 
ing D/Py) in a ss gain-of-function would have 
an effect on the survival of pDm8s (fig. S6D), 
almost all of which normally die (Fig. 3G). 
The decreased number of yDm8s was accom- 
panied by an increase of pDm8s (fig. S6D; 
IGMR-ss = 3.4, DIPy + IGMR-ss = 29). Taken 
together, these results show that the size of the 
pDm8 population is affected by the number of 
yDm8s. Dm8s must compete for targeting, 
and affecting the size of one population affects 
the survival of the other. This competition 
might be regulated by self-avoidance mech- 
anisms (41) and would explain why there is 
never more than one Dm8 home column 
per R7. 


Discussion 


In Drosophila, patterning of the mosaic of 
ommatidial subtypes is established by sequen- 
tial steps and is initiated in R7 photoreceptors 
by a single transcription factor, Ss. This initial 
decision is then transmitted to the R8 of the 
same ommatidium so that R7 and R8 have 
paired rhodopsin expression (15). In contrast, 
our results show that the mechanism respon- 
sible for the coordination of R7 subtype speci- 
fication with their main postsynaptic target 
in the brain is different. Dm8 subtype specifi- 
cation does not depend on direct induction 
from R7s, and each Dm8 subtype is produced 
independently from their corresponding Rs. 
We found that the matching occurs during a 
later stage of circuit formation, after the spec- 
ification of the different components of the 
circuit. After matching with their R7 subtypes, 
controlled by Dpr11/DIPy for yR7s/yDm8s, 
supernumerary Dm8s are culled by apoptosis. 
Because the ratio of ommatidial subtypes can 
vary among individuals (42), this mechanism 
allows the perfect matching that is observed 
in adults, where most R7s, if not all, are inner- 
vated by a single Dm8 of the proper type. This 
developmental plasticity provides a powerful 
mechanism to transmit the stochastic forma- 
tion of the photoreceptor mosaic to the deter- 
ministic patterning of the brain. 


Role of DIPy and Dpri1 in synaptic 
partner pairing 


The complementary expression of Dpr and 
DIP binding pairs in synaptic partners raised 
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the possibility that these proteins might be 
the long-sought “Sperry molecules” that act 
as molecular tags to instruct synaptic spec- 
ificity (28, 31, 43). Here, we provide evidence 
that DIPy in yDm8 and Dpr11 in yR7 instruct 
synaptic partner matching. Similar to the ge- 
netic removal of yR7s, loss of either DIPy or 
Dpril leads to apoptosis of yDm8s; this sug- 
gests that in their absence, yDm8s are unable 
to connect to yR7s and to receive from yR7s 
the trophic support required for their sur- 
vival. In both mutants, this is accompanied 
by limited mispairing of yDm8s with pR7s, 
whereas ectopic expression of these mole- 
cules is sufficient to create mispairing between 
R7and Dm8 subtypes. However, targeting to 
the proper M6 layer of the medulla is not 
affected in these mutants, nor is the dendritic 
size of yDm8s. Thus, we propose that Dprs 
and DIPs act during a later step of circuit 
formation to allow distinct neurons that pro- 
ject to the same layer to recognize their ap- 
propriate synaptic partners. 

Analysis of other Dpr/DIP pairs in and out- 
side the visual system supports this view: In 
the medulla, loss of either DIPa or its two 
ligands, Dpré6é and Dpr10, results in apoptosis 
of a proportion of three Dm neuron types that 
express DIPa, likely because these neurons 
are unable to recognize their targets (39). In 
the olfactory system, loss of DIPs leads to the 
disorganization of olfactory glomeruli (44). At 
the larval and adult neuromuscular junction, 
DIPa is expressed in a subset of motoneurons, 
whereas its binding partner Dpr10 is in mus- 
cles (45, 46). Loss of either leads to the partial 
loss of the innervation of the muscle by moto- 
neurons. During development, mutant adult 
motoneurons extend normal filopodia that 
target the proper muscles. However, these 
filopodia fail to be maintained, likely because 
they are unable to recognize the proper mus- 
cles in the absence of these molecules (45). 
In the lamina, loss or misexpression of DIPs 
leads to ectopic synapse formation (47). 

Taken together, this work suggests that Dprs 
and DIPs establish synaptic specificity. The 
difference in phenotypes in different systems 
(e.g., Survival; mistargeting or loss of axonal 
branches) might reveal the different require- 
ments for such molecules in distinct circuits. 


Apoptosis as a mechanism for numerical 
matching of neuronal pairs 


Programmed cell death facilitates quantitative 
matching of synaptic partners (48). For exam- 
ple, target-derived nerve growth factor (NGF) 
promotes survival of sympathetic and sensory 
neurons (49, 50). This discovery led to the de- 
velopment of the neurotrophic theory, which 
states that neurons are produced in excess and 
that competition for limited trophic support 
allows for the numerical matching of afferents 
with their targets in the periphery. Here, we 
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provide evidence that a similar phenomenon 
happens in the Drosophila central nervous 
system. We show that yDm8s are produced in 
excess, and that around 25% are eliminated 
by apoptosis during normal development. This 
cell death could be fully rescued by increasing 
the number of yDm8 afferents, or conversely be 
aggravated by decreasing the number of yR7s. 
Because D/Py and dpril mutants phenocopy 
the loss of yR7, uncovering a link between 
synaptic pairs matching and survival, this 
argues that the numerical matching of pairs 
of R7 and Dm8 is obtained by apoptosis of 
unmatched Dm8s. 

In other parts of the visual system and of 
the brain, neuronal survival is not affected in 
DIP mutants (44-47). Thus, cell death might 
reveal circuit-specific properties of the forma- 
tion of the visual system. For yDm8s, the de- 
pendency on targeting for survival allows the 
1:1 matching of yDm8s and ykR7s. For other 
multicolumnar neurons, one can only specu- 
late about the function of normally occurring 
cell death. The medulla is composed of ~’750 
retinotopically organized columns that can be 
considered as repetitive microcircuits that each 
compute information from a single point of the 
visual field. Each column needs to be inner- 
vated by the proper neurons in the proper 
amount. Thus, creating more multicolumnar 
neurons that require target-derived trophic 
support to survive might allow the complete 
innervation of the medulla while having the 
optimal number of neurons. 


Materials and methods 
Drosophila strains 


Flies were kept on standard cornmeal medium 
at 25°C 12-hour light/dark cycles (except when 
otherwise specified). 

Dprit’°??7"(44.0181), DIP °?"?(435928), 
GMR24F06-Gal4(#49087), GMR24F06-LexA 
(#52695), GMRI3E04-Gald#48565), GMRI3E04- 
LexA(#52457), OK371-Gal4(#26160), tj-Gal4'* 
(K#104.055), Optix-Gal#’?"?"(K#104.266), hh- 
Gal4(#67046), pap °°>5(#37891), DIPo:: 
GFP™"©?031(460523), DIPS::GFP§5(#60558), 
Dpr6::GFp 5459287), Dpr0::GFPM 0" 
(#59807), Dpr12::GFP""(460171). 

R57C1O-FLPG5::PEST(#64089), IOXUAS-FRT- 
stop-FRT-myr::smGdP-FLAG(#62130), IOXUAS- 
FRT-stop-FRT-myr::smGdP-HA, 1O0XUAS-FRT- 
stop-FRT-myr::smGdP-V5-THS-10XUAS-FRT- 
stop-FRT-myr::smGdP-FLAG(#64085), GMR- 
myr::RFP(#7120, #7122, #7123), Rh4-lacZ(#8480, 
#8481), UAS-nls::BGal(#3955), LerAop-FRT- 
stop-FRT-CD8::GFP(#57588), UAS-P35(#6298, 
#5072), IOXUAS-myr::GFP(#32198), 10X UAS- 
CD8::RFP(#32219), Act-FRT-stop-FRT-nls::B 
Gal(#6355), 13eLexAop2-CD8::GFP(#32205, 
#32203), 13~LexAop2-myr::GFP(#32209), 
13xLexAop2-6xmCherry(#52272), R11CO5- 
LexA(#54608), R47G08-Gal4(#50328), FRT82b, 
Tub-Gal80(#5135), Tub-Gal80ts(#7108), FRTI9A 
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(#1709), hs-Flp, FRT19A, Tub-Gal80(#5132), 
20XUAS-FLPG5::PEST(#55807), 13xLexAop2- 
FRT-stop-FRT-myr::smGdP-V5(#62107), Df 
(3R)Exel7330(#7985), sev'*(#10546), Act-FRT- 
stop-FRT-LexA. 

OK371-Vp16, OrtC2b-Gal4, OrtCI-3-Vp16 from 
Chi-Hon Lee, ss““’ from R. Johnston, dprir”” 
and DIPy" from L. Zipursky (39), DIPy- 
Gal4“'°*?2? from H. Bellen (28). 

From this study: (GMR-ss, IGMR-hth, IGMR- 
dpril, LexAop2-FRT-stop-FRT-myr::RFP, pxb- 
T2A.Gal4""5, IOXUAS-DIPa, IOXUAS-DIP$, 
IOXUAS-DIPy, DIPy-Gal4DBD and DIPy-Gals0. 

The detailed genotype for each figure is 
given in table S1. 


Immunohistochemistry 


Fly brains were dissected in ice-cold PBS and 
fixed for 3 hours in 4% formaldehyde (v/w) in 
1x PBS at 4°C. After a 30-min wash in PBST 
(lx PBS + 0.4% Triton X-100 and 0.5% goat 
serum), brains were incubated for 2 days in 
primary antibodies diluted in PBST, followed 
by 2 days with secondary antibodies diluted 
in PBST. After washes, brains were mounted 
in Slowfade and imaged on either a Leica SP5 
or SP8 confocal. The antibodies used are listed 
in table S2. 
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CELL BIOLOGY 


Light-regulated collective contractility in a 
multicellular choanoflagellate 


Thibaut Brunet**, Ben T. Larson“, Tess A. Linden’, Mark J. A. Vermeij’, 


Kent McDonald’, Nicole King'+ 


Collective cell contractions that generate global tissue deformations are a signature feature of animal 
movement and morphogenesis. However, the origin of collective contractility in animals remains unclear. 
While surveying the Caribbean island of Curacao for choanoflagellates, the closest living relatives of 
animals, we isolated a previously undescribed species (here named Choanoeca flexa sp. nov.) that forms 
multicellular cup-shaped colonies. The colonies rapidly invert their curvature in response to changing 
light levels, which they detect through a rhodopsin-cyclic guanosine monophosphate pathway. Inversion 
requires actomyosin-mediated apical contractility and allows alternation between feeding and swimming 
behavior. C. flexa thus rapidly converts sensory inputs directly into multicellular contractions. These 
findings may inform reconstructions of hypothesized animal ancestors that existed before the evolution 


of specialized sensory and contractile cells. 


he evolution of animals from their single- 

celled ancestors involved several major 

evolutionary innovations, including mul- 

ticellularity, spatial cell differentiation, 

and morphogenesis (J, 2). Efforts to re- 
construct the origin of animal multicellularity 
have benefited from the study of choanofla- 
gellates, the closest living relatives of animals 
(3-5). Choanoflagellates are microbial eukary- 
otes that feed on bacteria and live in aquatic 
environments around the world; many species 
differentiate over their life history into diverse 
cell types, including unicellular and multicel- 
lular forms (3, 6-8). Comparative genomics 
has revealed many gene families once thought 
to be unique to animals (such as cadherins, 
C-type lectins, and receptor tyrosine kinases) 
in choanoflagellates (4, 9-12). Moreover, the 
model choanoflagellate Salpingoeca rosetta 
(6) exhibits diverse responses to environ- 
mental cues, including pH-taxis (13), aerotaxis 
(14), and bacterial regulation of multicellular 
development (75) and mating (16). However, 
S. rosetta is only one of ~380 known species 
(17), and choanoflagellates are at least as gene- 
tically diverse as animals (9). Choanoflagellate 
diversity thus represents a largely untapped 
opportunity to investigate environmental reg- 
ulation of cell behavior, the principles that 
broadly underpin multicellularity, and the evo- 
lution of animal cell biology. We report here on 
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the discovery of light-regulated collective cell 
contractility in a recently isolated multicel- 
lular choanoflagellate. This finding reveals that 
apical cell contractility evolved before the orig- 
in of animal multicellularity. 


Photic cues induce sheet inversion in a 
colonial choanoflagellate 


During a survey of choanoflagellate diversity 
on the Caribbean island of Curacao (Fig. 1, A 
and B), we collected large, cup-shaped colonies 
of protozoa (~100 ym in diameter) from shal- 
low splash pools above the tide line of a rocky 
coastal area (Fig. 1B). Each colony was com- 
posed of a monolayer (“sheet”) of up to hun- 
dreds of flagellated cells (Fig. 1C and movie S1). 
The cells bore the characteristic collar complex 
of choanoflagellates (1, 3), in which a “collar” 
of microvilli surrounds a single apical flagel- 
lum (Fig. 1D). However, unlike in most choa- 
noflagellate colonies (3), the apical flagella 
pointed into the interior of the colony (Fig. 
1E), resembling the orientation of collar cells 
in the choanocyte chambers of sponges (8). 
The colonies inverted rapidly (within ~30 s 
from initiation to completion) while maintain- 
ing their cell topology, such that the flagella 
pointed outward along the radius of curvature 
of the colony (Fig. 1, F and G, and movie S2). 
The colonies tended to remain in the inverted 
form (“flagella-out”) for several minutes before 
reverting to their initial, relaxed conformation 
(“flagella-in”) in a similarly rapid process (Fig. 
1H and movie S3). 

To start laboratory cultures of this choano- 
flagellate, we manually isolated representative 
colonies away from the other microbial eukary- 
otes in the splash pool sample (for example, 
presumptive Oxyrrhis sp. dinoflagellates) (Fig. 
1C and movie S1) and transferred them into 
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nutrient-supplemented artificial seawater 
along with co-isolated environmental bacteria 
(which choanoflagellates need as a food source). 
Analysis of the 18S ribosomal DNA (rDNA) 
sequence indicated that the choanoflagellate 
is the sister species of the previously de- 
scribed Choanoeca perplexa (also known as 
Proterospongia choanojuncta) (Fig. 1J and fig. 
S1), whose life history includes both single 
cells and colonies (7, 19). Inspired by the sheet- 
bending behavior of the Curacao choano- 
flagellate, we named it Choanoeca flexa. [A 
similar behavior was mentioned in a 1983 study 
of C. perplexa (7), but the culture subsequently 
stopped forming colonies, preventing further 
study (3).] C. flewa sheet inversion, represent- 
ing a global change in multicellular form, is 
reminiscent of concerted movement and mor- 
phogenesis in animals (such as muscle contrac- 
tion or gastrulation). Because of the potential 
evolutionary implications of rapid shape change 
in C. flexa, we investigated (i) how colony inver- 
sion is regulated, (ii) the mechanisms underly- 
ing colony inversion, and (iii) the ecological 
consequences of colony inversion. 

Several lines of evidence indicated that light 
levels regulate C. flexa colony inversion. While 
imaging C. flexa sheets for >1 hour under 
constant illumination, colony inversions be- 
came less frequent. By contrast, after the mi- 
croscope illumination was turned off, the 
colonies inverted almost immediately (movie 
S4). To quantify light-to-dark-induced C. flexa 
inversion, we established an assay based on the 
observation that the projected area of a C. flexa 
sheet decreases by as much as 50% during in- 
version (Fig. 2, A to C, and movie S5). Using this 
assay, we confirmed that a rapid decrease in 
illumination reliably induces inversion of C. flera 
colonies within 30 s (Fig. 2D and movie S6). 
Choanoflagellates had previously been thought 
to be insensitive to light (20). 


A rhodopsin-cGMP pathway regulates 
colony inversion in response to 
light-to-dark transitions 


We next investigated how C. flexa colonies 
sense and respond to changing light levels. 
Although choanoflagellates are unpigmented 
and transparent, at least four species en- 
code a choanoflagellate-specific rhodopsin- 
phosphodiesterase fusion protein (fig. S2) 
(9), RhoPDE, that has been investigated as a 
potential optogenetic tool (21-24). RhoPDEs 
consist of an N-terminal type I rhodopsin 
[a photosensitive transmembrane protein 
broadly involved in light detection (25)] fused 
to a C-terminal phosphodiesterase (PDE) that 
catalyzes cyclic nucleotide hydrolysis (Fig. 3A). 
In in vitro studies (27-24), S. rosetta RhnoPDE 
appears capable of converting a photic stim- 
ulus into a biochemical signal within seconds, 
similar to the time scale of the C. flewa response 
to light-to-dark transitions. 
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To search for RhoPDE or other candidate 
photosensitive proteins in C. flewa, we se- 
quenced and assembled the C. flexa tran- 
scriptome (26), which we found to encode 
four RhoPDE homologs (fig. $2) (GenBank 
accession numbers MNO013138, MN013139, 
MN013140, and MNO13141). No other rhodop- 
sins were detected in the C. flexa transcrip- 
tome. The only other candidate photoreceptor 


protein domain found was a cryptochrome 
transcription factor (27), which is predicted to 
act on the time scale of transcriptional regu- 
lation [at least several minutes (28, 29)] and, 
therefore, is unlikely to mediate the light-to- 
dark transition response. Thus, we focused our 
attention on the RhoPDEs. 

If RhoPDE regulates the light-to-dark tran- 
sition response, depletion of the rhodopsin 


chromophore, retinal (30, 37), should abolish 
rhodopsin activity and thereby prevent the re- 
sponse. Moreover, artificially increasing the 
cellular concentration of cyclic guanosine 
monophosphate (cGMP) or cyclic adenosine 
monophosphate (cAMP) (which are degraded 
by the enzymatic activity of PDEs) should 
mimic the effect of darkness and be suffi- 
cient to trigger sheet inversion. 


Fig. 1. Multicellular sheets of 
a colonial choanoflagellate 
from Curagao rapidly and 
reversibly invert in curvature. 
(A to C) C. flexa was isolated 
from splash pools on the 
northern shore of Curagao. 

(A) Map of the Caribbean Sea. 
Arrowhead: Curagao. (Inset) Map 
of Curagao. Inset arrowhead: 
sampling site (12°13'38.9" 

N 69°00'47.0" W). (B) Photo- 


A North America 


Curacao 


South America| 


graph of the sampling site. Arrow: 
splash pool. (C) Microscopy of 
freshly collected splash pool 
sample revealed a diverse 
microeukaryotic community, 
including dinoflagellates 
(Oxyrrhis sp.; arrowhead) and 
cup-shaped choanoflagellate 
colonies (C. flexa; arrows) that 
rapidly inverted their curvature. 
Still frame from movie S1. 

(D) Diagnostic features of choa- 
noflagellate cell morphology 
shown by differential interference 
contrast (DIC) micrograph 

and sketch of a C. flexa cell. 

(E and F) C. flexa colonies 
alternate between two conforma- 
tions, flagella-in (E) and flagella- 
out (F). (G@ and H) C. flexa 
colonies rapidly and reversibly 
invert their curvature while main- 
taining contacts among 
neighboring cells. (G) Flagella-in 
colony inverts to the flagella-out 
orientation (movie S2). (H) 
Flagella-out colony reverts to 

the flagella-in orientation 

(movie S3). Movie frames were 
rotated to facilitate tracking 
individual cells between images. 
(1) Summary of the inversion and 
relaxation processes. (J) Phylo- 
genetic analysis of 18S rDNA 
sequences revealed that C. flexa 
(bold) is a sister to the choano- 
flagellate C. perplexa (19). 
(Scaled branch lengths are 

in Fig. S1.) 
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Fig. 2. Light-to-dark transitions induce C. flexa colony inversion. (A to C) Colony inversion correlates 
with a decrease in the projected area of the colony. (A and B) C. flexa colony spontaneously inverts from 
the flagella-in (A) to the flagella-out (B) orientation (movie S5). (Insets) Pseudocoloring highlights the 
orientations of the cells in the boxed regions. Cell orientation relative to colony curvature inverts without 
breaking contacts with neighboring cells. (C) The projected area of the colony from (A) and (B) decreased 
by ~50% during inversion. Sheet area was normalized to its preinversion projected area. (D) Colonies reliably 
undergo inversion in response to light-to-dark transitions. Normalized projected area of n = 5 colonies 
over time before and after light reduction (vertical dashed line). Line: mean projected area (rolling average 


over 5-s windows); ribbon: standard deviation. 


Plants and some bacteria synthesize retinal 
and other carotenoids, but the C. flexa tran- 
scriptome lacks a key enzyme in the retinal 
biosynthesis pathway (fig. S3). Therefore, like 
animals, C. flera must receive retinal or its 
precursor, beta-carotene, from its food. It is 
possible that C. flexa acquires retinal or beta- 
carotene from its bacterial prey, with which it 
is cultured. To test whether bacterially pro- 
duced carotenoids are required for light- 
regulated colony inversion, we established 
a culture containing only C. flera and a co- 
isolated bacterium, Pseudomonas oceani, that 
lacks genes in the retinal biosynthesis path- 
way (fig. S3) (32). This culture, named “ChoPs” 
(for Choanoeca + Pseudomonas), was expected 
to be devoid of carotenoids, thereby abolishing 
rhodopsin activity. As predicted, C. flera sheets 
in ChoPs cultures did not invert in response to 
darkness (Fig. 3B). Inoculating ChoPs cultures 
with a mixture of co-isolated environmental 
bacteria restored the light-to-dark response, 
demonstrating that a bacterial factor is neces- 
sary for the response. Addition of exogenous 
retinal to ChoPs cultures restored the wild- 
type light-to-dark response (Fig. 3C), indicat- 
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ing that the absence of retinal explains the 
absence of colony inversion. The requirement 
of retinal for the inversion response and the 
fact that all rhodopsin-encoding genes in the 
C. flewa are RhoPDEs suggest that one (or 
more) RhoPDE mediates the light-to-dark- 
induced inversion. 

We next investigated whether cyclic nucle- 
otide signaling influences C. flexa inversion. 
Treatment of C. flexa sheets with two inhibitors 
of PDE activity, caffeine (33) and 3-isobutyl-1- 
methylxanthine (IBMX) (34), induced colony 
inversion in the absence of a photic stimulus 
(Fig. 3D). Moreover, incubating C. flexa sheets 
with a cell-permeant analog of cGMP induced 
colony inversion in a dose-responsive manner, 
whereas a cell-permeant analog of cAMP had 
no effect (Fig. 3E), suggesting that cGMP acts 
as a second messenger in phototransduction 
and thereby triggers colony inversion. 

Together, these results indicate that the 
C. flewa response to light-to-dark transitions 
relies on a rhodopsin as a photoreceptor and 
cGMP as a second messenger. The simplest 
interpretation of these findings is that a 
RhoPDE controls C. flera phototransduction. 
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However, direct validation will require tar- 
geted disruption of all four C. flera RhoPDE 
homologs (fig. S2). 


Sheet inversion mediates a trade-off between 
feeding and swimming 


What are the functional and ecological roles of 
sheet inversion in C. fleva? Flagella-in sheets 
showed little to no motility, either slowly 
drifting or settling to the bottom of flasks 
(movie S7), whereas inverted (flagella-out) 
sheets swam rapidly (Fig. 4, A and B; fig. S5; 
and movie S4). By contrast, cells in flagella-in 
sheets fed efficiently (>75% cells per sheet 
internalizing beads), whereas cells in flagella- 
out sheets did not (~10% cells per sheet in- 
ternalizing beads on average) (Fig. 4, C to G). 
Moreover, in relaxed sheets, fluid flow con- 
verged toward the center of the colony (carrying 
bacterial prey toward the cells), whereas in 
inverted sheets, the flow was directed away 
from the colony, allowing swimming but not 
feeding (fig. S6). Thus, sheet inversion increases 
motility, which may allow escape from environ- 
mental hazards (including predators), whereas 
sheet relaxation allows enhanced feeding 
efficiency. 

Because sheets swim slowly when relaxed 
and rapidly when inverted, we suspected that 
darkness-induced inversion might allow sheets 
to accumulate in bright areas, effectively 
undergoing phototaxis. Using chambers illumi- 
nated with directional light, we found that 
C. flexa sheets tended to accumulate near the 
illumination source compared with a control in 
which no illumination was provided (Fig. 4H 
and fig. $7). Further, neither sheets from ChoPs 
cultures nor single cells from dissociated sheets 
were capable of phototaxis, suggesting that 
rhodopsin activity, multicellularity, and sheet 
inversion are all required for phototaxis (Fig. 
4H). Thus, sheet inversion mediates an ecolog- 
ically relevant trade-off between feeding and 
swimming (Fig. 41). 


Sheet inversion requires apical 
actomyosin contractility 


How do cells in sheets interact, and what 
mechanisms allow sheet inversion? We found 
that cells in C. flexa sheets form direct contacts 
between their collar microvilli (Fig. 5, A to C, 
and fig. $8) but not through the intercellular 
bridges, shared extracellular matrix, or filopo- 
dial contacts that mediate multicellularity in 
other choanoflagellate species (6, 7, 10, 35). 
Collar morphology differs between relaxed 
and inverted sheets. In relaxed sheets (flagella- 
in; Fig. 5C), the microvilli on each cell as- 
semble into a barrel-shaped collar whose 
diameter varied little from base to tip. In 
inverted sheets (flagella-out; Fig. 5C), the 
microvilli form a flared, cone-shaped collar 
whose diameter increases from base to tip. 
Active “opening out” of the collar, by increasing 
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the surface area of the apical side of the sheets 
relative to their basal side, might force a change 
in sheet curvature. Consistent with this, the col- 
lar of dissociated C. flexa cells treated with caf- 
feine opened into a conical shape, straightened, 
and slid down toward the base of the cell, 
whereas untreated controls maintained a 
barrel-shaped collar (Fig. 5, D to G, and fig. S9). 
These data suggest that the changes in collar 
geometry during inversion are actively gen- 
erated by individual cells and do not require 
interactions among neighboring cells. 

Sheet bending in animal epithelia is fre- 
quently mediated by apical constriction, in 
which contraction of an apical actomyosin net- 
work reduces the surface area of the apical 
side of the cell (36, 37). Apical constriction is 


Fig. 3. C. flexa cells transduce light stimuli 
through a rhodopsin-cGMP pathway using 
bacterial carotenoids. (A) RhoPDE (blue), 

a choanoflagellate-specific enzyme rhodopsin. 

C. flexa encodes four copies of RhoPDE (fig. S2), 
each comprising an 8-transmembrane-pass 

type | (bacterial) rhodopsin fused to a cyclic 
nucleotide PDE. Photodetection by rhodopsin 
requires binding of the chromophore retinal. 
Under illumination (left panel), the rhodopsin 
domain activates the PDE domain, which 
hydrolyzes cGMP to 5'GMP, thereby reducing 
cellular CGMP levels. Upon light reduction (right 
panel), the PDE domain is inactivated, allowing 
cellular cGMP levels to rise. (B) A bacterial factor 
is required for light-regulated sheet inversion. 

In the presence of diverse “polyxenic” bacteria, 
C. flexa sheets inverted to flagella-out in response 
to decreased illumination (n = 3 colonies). By 
contrast, C. flexa sheets grown only with the 
bacterium P. oceani (ChoPs culture, n = 4 
colonies) did not respond to changes in illumina- 
tion. The photic response of the ChoPs culture 
was restored by inoculating with environmental 
bacteria from the polyxenic culture (ChoPs + env. 
bacteria, n = 3 colonies) but not by inoculating with 
P. oceani bacteria (ChoPs + P. oceani, n = 3 
colonies). (C) Retinal (or one of its carotenoid 
precursors) is the bacterial molecule required for 
the photic response. The photic response of the 
ChoPs culture, which normally does not respond to 
light stimuli (n = 4 colonies), was restored by 
treatment with 125 nM or 500 nM retinal (n = 5 
colonies each). Thus, retinal is required for inversion 
n response to reduced illumination. For (B) and (C), 
photic response was quantified as for Fig. 2. 

(D) PDE activity suppresses sheet inversion in 
C. flexa. Treatment with the PDE inhibitors 
caffeine (10 mM) or IBMX (1 mM) caused C. flexa 
colonies to invert to the flagella-out orientation 
in the absence of a photic stimulus (n = 3 
independent trials; N = 52, 55, and 38 sheets for 
controls; N = 23, 31, and 40 sheets for caffeine; 
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mediated by molecular motors belonging to 
the myosin II family, which predates the diver- 
sification of modern eukaryotes (38) and is 
found in all choanoflagellate genomes (39) 
and transcriptomes (9) published to date. 
C. flexa encodes homologs of the myosin II 
regulatory light chain (GenBank accession 
MK787241) and heavy chain (GenBank acces- 
sion MK787240) (fig. S4), whose protein se- 
quences are 78 and 63% similar, respectively, 
to their human counterparts. 

In choanoflagellates and animal epithelial 
cells, the apical pole is defined by the presence 
of a flagellum or cilium, respectively, and/or 
microvilli, and the apicobasal axis of both types 
of cells is broadly considered to be homologous 
(40). C. flexa sheets contain a pronounced F-actin 
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ring at the apical pole of each cell, from which 
the microvillar collar extends (Fig. 5H). Ex- 
tending from this ring, we detected a small 
number of longitudinal actin fibers (usually 
two or three) pointing toward the basal pole. 
Diameter measurements showed that the 
F-actin ring was consistently smaller in in- 
verted sheets compared with relaxed sheets 
(Fig. 5, I to L). The same was true for disso- 
ciated, caffeine-treated cells compared with 
the corresponding negative controls (Fig. 5, I 
to J and L), consistent with the ring actively 
constricting during sheet inversion. During 
inversion and in response to caffeine, some 
(but not all) cells transiently acquired a “bottle 
cell” morphology with a narrow apex and a 
bulbous base (fig. $9), reminiscent of animal 
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N = 42, 37, and 27 sheets for IBMX). (E) cGMP induces sheet inversion. Treating the ChoPs culture with a cell-permeant cGMP analog (8-Br-cGMP) caused sheets to 
invert into the flagella-out orientation in a dose-dependent manner in the absence of a photic stimulus. Treating with 8-Br-cAMP had no effect. 
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cells undergoing pronounced apical constric- 
tion (41, 42). Caffeine treatment also induced 
shortening of the longitudinal actin fibers (fig. 
S9, G to H), suggesting that fiber contraction 
pulls the collar toward the basal pole. 

Using an antibody raised against C. flexa 
myosin II as well as five different commercial 
myosin II antibodies (Fig. 5, M to S, and fig. 
S10), we found that C. flea cells contain 
myosin that overlaps in regions with the api- 
cal actin ring (Fig. 5, M to P, and fig. S10) and 
longitudinal fibers (Fig. 5, Q to S), consistent 
with the idea that the apical actin network is 
contractile. Inhibition of myosin II activity 
with blebbistatin (43) abolished ring con- 
striction in caffeine-treated dissociated cells 
(fig. S11A) and prevented sheet inversion 
(Fig. 5T), as did inhibition of dynamic actin 
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polymerization with latrunculin B (44) and 
inhibition of phosphorylation of the myosin 
regulatory light chain with ML-7 (45) (Fig. 
5T). None of these drugs affected flagellar 
beating, consistent with them specifically tar- 
geting actomyosin. Although compaction is 
normally associated with inversion, drug treat- 
ment resulted in sheet compaction in the ab- 
sence of inversion (fig. S11B), suggesting that 
a baseline level of tension is needed to main- 
tain spacing between cells. Together, these 
results suggest that sheet inversion requires 
apical constriction of an actomyosin network 
at the base of the collar (Fig. 5U). 


The ancestry of apical constriction 


The discovery of sheet bending driven by apical 
constriction in a multicellular choanoflagel- 
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late has several potentially important evolu- 
tionary implications. Epithelial sheet bending 
is a fundamental mechanism underlying ani- 
mal embryonic development (36, 46, 47), and 
multicellular contractility also plays a fun- 
damental role in the behavior of adult animals 
by allowing fine-tuned body deformations 
(48). As both embryonic and adult tissue con- 
tractility are found in nearly all animal lin- 
eages, including sponges (49, 50), ctenophores 
(51, 52), placozoans (53), cnidarians (41, 54), and 
bilaterians (36, 37, 55), both were likely present 
in the last common animal ancestor. By contrast, 
collective contractility and apical constriction 
were hitherto unknown in close relatives of 
animals, making their origin mysterious. 
The existence of actomyosin-mediated api- 
cal constriction in C. flexa raises the possibility 
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Fig. 4. Sheet inversion mediates a trade-off between swimming and feeding. 
(A and B) Flagella-out sheets swim faster than flagella-in sheets. (A) After light-to- 
dark-induced inversion, swimming speed increased quickly (movie S4), as quantified 
by an increase in the amount of movement (defined as 1-correlation, where 
“correlation” is the Pearson correlation between two consecutive frames). Movement 
was normalized between 0 and 1 for each of n = 9 movies. Gray shading: error 
bars representing standard deviation. (B) Sheets swim faster after caffeine-induced 
inversion under constant light. Movement quantified as in (A). n = 9 time-lapse 
movies for controls (relaxed sheet populations under constant light) and n = 10 
movies for caffeine-treated samples. (€ to G) Flagella-in sheets feed more efficiently 
than flagella-out sheets. Internalization of 0.2-m fluorescent beads was used to 
quantify phagocytic activity. (C to F) Detection of fluorescent microbeads 
phagocytosed by flagella-in sheets (untreated) and flagella-out sheets (caffeine- 
treated), fixed after 1 hour. C. flexa cells were visualized by DIC (C and E) and beads 
by green fluorescence (C to F). Arrowheads: fluorescent beads [inside the 


Brunet et al., Science 366, 326-334 (2019) 18 October 2019 


cells in (C); stuck to a flagellum in (E)]. (G) Proportion of cells having phagocytosed 
beads in n = 17 flagella-in sheets compared with n = 21 flagella-out sheets. 

P value: x° test. (H) C. flexa phototaxes in a retinal- and multicellularity-dependent 
manner, as measured by the phototaxis index, which quantifies directional 
accumulation of the sheets toward a localized light source (materials and 
methods). Light-responsive polyxenic (Px) sheets migrated toward a 

lateral light source over 1 hour (n = 12 experiments). By contrast, no directional 
accumulation was observed in Px sheets without directional light (n = 12 
experiments), in dissociated single cells (n = 9 experiments), or in retinal- 
deprived light-insensitive monoxenic cultures (ChoPs; n = 10 experiments) that 
do not undergo inversion. P values: analysis of variance with Dunnett's 
correction. (I) In flagella-in sheets, flagellar beating generates a feeding flow 
that carries bacteria toward the basal side of the cells (fig. S7), allowing 
phagocytosis. In flagella-out sheets, flagellar beating causes rapid swimming, 
whereas the basal side of the cells faces inward, preventing prey capture. 
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that this cellular module might have been 
present in the last common ancestor of choa- 
noflagellates and animals [which together 
compose the choanozoans (J)]. In addition to 
C. flexa, in which collar contractions occur in 
sheets (Fig. 5G), dissociated cells from sheets 
(Fig. 5, D and E), and naturally solitary “thecate” 


Fig. 5. Sheet inversion requires apical actomyosin 
cell contraction. (A and B) Cells in a sheet are 
linked by their collars. (A) DIC micrograph showing 
direct collar contacts between neighboring cells. 
f, flagellum; m, microvilli. Dotted line: approximate 
plane of section in (B). (B) Transmission electron 
micrograph of a transverse section through the 
collars of neighboring cells. (€) Collar morphology 
differs between flagella-in sheets (top) and flagella- 
out sheets (bottom). Cells in flagella-in sheets have 
barrel-shaped collars, whereas cells in flagella-out 
sheets have flared, conical collars. Staining: phalloi- 
din. (D to G) Caffeine treatment of dissociated 

cells causes the collar to flare. (D) In the absence 

of caffeine, the collar of a cell from a dissociated 
colony curves upward. (E) In the presence of 10 mM 
caffeine, the collar of a cell from a dissociated colony 
straightened and flared open. Arrowheads: microvilli. 
(F) Differences in collar morphology where quantified 
in control and caffeine-treated single cells by 
measuring the collar angle (defined by the tip of 
two opposite microvilli and the base of the flagellum) 
(fig. $9). (G) Collar angles in caffeine-treated cells 
(n = 28) are wider than in control cells (n = 16). 

P value from Mann-Whitney U test. (H) Actin ring 
connected to longitudinal fibers is present at the base 
of each collar. (Inset) Higher magnification showing 
actin rings and a longitudinal fiber. Staining: 
phalloidin. (I to L) Actin ring constricts during 
inversion in intact sheets and in response to caffeine 
in isolated cells. (I and J) Actin ring observed with 
phalloidin in an untreated cell (I) and in a cell treated 
with 5 mM caffeine (J). (K) Ring diameter is larger 
in flagella-in sheets (n = 8 sheets, N = 124 cells) than 
in flagella-out sheets (n = 7 sheets, N = 110 cells). 

P value from Mann-Whitney U test. (L) Ring diameter 
is smaller in 10 mM caffeine-treated dissociated cells 
(n = 74) and 5 mM caffeine-treated dissociated cells 
(n = 82) than in untreated dissociated cells (n = 89). 
P values are from Dunnett's test for comparing 
several treatments with a control. (M to S) C. flexa 
myosin II localizes to the actin ring and longitudinal 
fibers. (M) Immunostained sheet showing apical rings 
of actin and myosin Il at the apical poles of all cells. 
(N to P) Closer views showing the apical ring. (Q to S) 
Side view of a stained C. flexa cell (apical side up) 
showing cytoplasmic foci of myosin Il (c.f.), as well 
as cortical staining (cortex) that overlaps with the 
longitudinal fibers. Green: anti-C. flexa myosin Il 
antibody, magenta: rhodamine-phalloidin (fig. S10). 
(T) Treatment with inhibitors of actin polymerization 
(latrunculin B, n = 6), myosin contractility (blebbis- 
tatin, n = 6), or myosin activation by phosphorylation 


cells (movie S8), collar contractions have also 
been reported in unicellular stages from three 
other choanoflagellates: Codosiga pulcherrima 
(56), Monosiga gracilis (57), and C. perplexa (7). 
We found that four other choanoflagellate spe- 
cies, Monosiga brevicollis, S. rosetta, Salpingoeca 
urceolata, and Diaphanoeca grandis, which 
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together cover both main branches of the 
choanoflagellate phylogenetic tree (17), display 
spontaneous changes in collar geometry oc- 
curring at the scale of a few seconds. S. wrceolata 
(movie S9) and M. brevicollis (movie S10) 
showed spontaneous and reversible opening 
and closing of the collar (similar to C. flexa), 
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(ML-7, n = 9) prevented sheet inversion in response to light-to-dark transitions. n = 13 DMSO-treated controls. Photic response is quantified as in Fig. 2. DMSO, 
dimethyl! sulfoxide. (U) Proposed role of actomyosin contractility in inversion. 
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Fig. 6. Apical constriction is conserved in choanoflagellates. (A) Spontaneous 
collar contractions observed in S. urceolata and M. brevicollis (movies S8 and S9). 
Collar angles measured as in Fig. 5F. th, theca. (B) Apical actomyosin ring 
(arrows) is detected at the base of the collar in four representative choanoflagellate 
species. Myosin Il was also detected in microvilli (m) in S. urceolata (arrowhead). 


whereas S. rosetta (movie S11) and D. grandis 
(movie S12) displayed subtler shape changes 
(reorientation of individual microvilli and 
modulation of collar curvature, respectively) 
(Fig. 6A and fig. S12). Like animal epithelial 
cells and C. flexa, all four species have an apical 
actomyosin ring at the base of the collar (Fig. 
6B). This suggests that the apical actomyosin 
ring is a conserved feature of choanoflagellate 
biology (Fig. 6C) and that unicellular apical 
constriction was present in the last common 
ancestor of choanoflagellates and animals. 


Brunet et al., Science 366, 326-334 (2019) 


Side view 


Top view 


F-actin Myosin Il | Composit 


Diaphanoeca 
grandis 


Salpingoeca 
urceolata 


Monosiga 
brevicollis 


Salpingoeca 
rosetta 


ys 
m 
aay 
> 

1 um 


e F-actin Myosin Il Composite 


Individual Collective 
actomyosin _ apical apical 
ring contraction contraction 


Apical 


BAN = 


apical actomyosin 


Lei pe 
{ = 
— 


apical actomyosin 


— 


(C) Apical constriction of 
ancestor of choanozoans 
constriction in C. flexa and 


C. flexa, we hypothesize th 


What is the function of apical constriction in 
single cells? In some sessile choanoflagellates, 
including in the thecate form of C. perplexa, 
collar contraction in response to mechanical 
stimulation allows retraction of the cell inside 
an extracellular structure called a theca (19), 
suggesting it represents a defensive with- 
drawal reflex from predators or other threats. 
In free-swimming cells, collar contraction 
might fine-tune the hydrodynamics of swim- 
ming or feeding. For example, a closed collar 
might reduce drag and facilitate locomotion, 
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@ evolution of individual apical contractility _@ evolution of collective apical contractility 


individual cells was present in the last common 
and independently gave rise to multicellular apical 
in animals. *C. perplexa was previously reported (7) to 


undergo transient inversions of colony curvature. On the basis of our study of 


at these inversions reflect collective apical constriction. 


whereas a flared collar could slow down swim- 
ming and increase collar area, thereby facilitat- 
ing prey capture. Validation of these functional 
hypotheses will require direct testing. 

In contrast to single-cell apical constriction, 
the multicellular sheet bending observed in 
C. flexa and C. perplexa (7) has not been 
reported in other choanoflagellates. This sug- 
gests that apical constriction was present in 
solitary cells in the last choanozoan common 
ancestor and was independently converted 
into multicellular sheet bending through the 
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evolution of intercellular junctions in animals 
(58) and the evolution of microvillar adhe- 
sions in C. flewa. Multicellular inversion has 
been proposed to have been part of the de- 
velopmental repertoire of ancient animals 
CZ, 59), on the basis of the existence of whole- 
embryo inversion (from flagella-in to flagella- 
out) during calcareous sponge development 
(60). A similar inversion (but much slower, 
about an hour long, and irreversible) takes 
place during the development of the alga 
Volvox (61). Given the large evolutionary 
distance between choanoflagellates and vol- 
vocalean green algae, along with the absence 
of inversion in intervening branches, inver- 
sion likely evolved independently in both 
groups (1). 

These observations suggest that apical 
actomyosin-mediated cell constriction evolved 
on the choanozoan stem lineage (Fig. 6B). 
Could polarized actomyosin contractility be 
even more ancient? Polarized actomyosin con- 
tractions have been implicated in multicellular 
morphogenesis in the fruiting body of the 
slime mold Dictyostelium (62) and may be 
homologous to those observed in choano- 
flagellates and animals. However, the absence 
of comparable processes in the intermediate 
branches between Dictyostelium and choano- 
zoans raises the possibility that polarized cell 
contractions in Dictyostelium and apical con- 
striction in choanozoans evolved independently 
(63). The ichthyosporean Sphaeroforma arctica, 
a close relative of choanozoans, forms large 
multinucleated spores that partition into 
distinct cells in an actomyosin-dependent 
process (64), providing an independent ex- 
ample of actomyosin-dependent multicellular 
development. 

In animals, the control of multicellular con- 
tractions invariably relies either on the co- 
operation of multiple cell types [as in adult 
organisms (54, 55, 65)] or on complex pro- 
grammed signaling cascades [as in embryos 
(36, 46, 47, 66, 67)]. By contrast, C. flexa di- 
rectly converts sensory stimuli into collective 
contractions, without observable spatial cell 
differentiation, and evokes some hypotheses 
of early animal evolution that envisioned the 
first contractile tissues as homogeneous myo- 
epithelia of multifunctional sensory-contractile 
cells (68). 

The fact that contractility in C. flera can be 
controlled by light represents another intriguing 
parallel to animal biology. Indeed, rhodopsin- 
cGMP pathways similar to that in C. flexa also 
underlie phototransduction in some animal 
cells [for example, bilaterian ciliary photo- 
receptors (31, 69) and cnidarian photorecep- 
tors (70, 71)], as well as in fungal zoospores 
(72). In contrast with choanoflagellates, how- 
ever, phototransduction in animal photorecep- 
tors relies on a type II (eukaryotic) rhodopsin 
that activates a separate PDE through a 
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G-protein intermediary (37, 69) (fig. S13). Mean- 
while, fungal zoospores use a distinct rhodop- 
sin fusion protein (a type I rhodopsin fused to 
a guanylyl cyclase) to increase cellular cGMP 
in response to light (72) (fig. S13). Ifa RhoPDE 
fusion protein controls C. flexa phototrans- 
duction, this would represent a third indepen- 
dent solution to the problem of transducing 
information from a change in illumination into 
a change in cyclic nucleotide signaling. 

Much remains to be discovered concerning 
the ecological function, mechanical underpin- 
nings, and molecular mechanisms of photo- 
transduction and apical constriction in C. flexa. 
A deeper understanding will require the devel- 
opment of molecular genetic tools, which have 
only recently been established in S. rosetta 
(35, 73) and D. grandis (74). Nonetheless, 
C. fleca demonstrates how the exploration of 
choanoflagellate diversity can reveal biologi- 
cal phenomena and provides an experiment- 
ally tractable model for studying multicellular 
sensory-contractile coupling. 
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WATER OXIDATION 


An oxyl/oxo mechanism for oxygen-oxygen coupling 
in PSII revealed by an x-ray free-electron laser 


Michihiro Suga'?*{, Fusamichi Akita’?*, Keitaro Yamashita*t, Yoshiki Nakajima’, Go Ueno®, 
Hongjie Li**, Takahiro Yamane’, Kunio Hirata®, Yasufumi Umena’, Shinichiro Yonekura’, 
Long-Jiang Yu’, Hironori Murakami®, Takashi Nomura*“, Tetsunari Kimura®, Minoru Kubo*“, 
Seiki Baba®, Takashi Kumasaka®, Kensuke Tono®®, Makina Yabashi**, Hiroshi Isobe’, 
Kizashi Yamaguchi”*, Masaki Yamamoto®, Hideo Ago*+, Jian-Ren Shen’+ 


Photosynthetic water oxidation is catalyzed by the Mn,CaO; cluster of photosystem II (PSII) 
with linear progression through five S-state intermediates (So to S,). To reveal the 
mechanism of water oxidation, we analyzed structures of PSII in the S;, S2, and S3 states 
by x-ray free-electron laser serial crystallography. No insertion of water was found in Sz, but 


flipping of D1 Glu*®® 


upon transition to S3 leads to the opening of a water channel and provides 


a space for incorporation of an additional oxygen ligand, resulting in an open cubane Mn,CaOg 
cluster with an oxyl/oxo bridge. Structural changes of PSII between the different S states reveal 
cooperative action of substrate water access, proton release, and dioxygen formation in 


photosynthetic water oxidation. 


xygenic photosynthesis by plants, algae, 

and cyanobacteria converts light energy 

from the sun into chemical energy in the 

form of sugar and concurrently releases 

dioxygen into the atmosphere, thereby 
sustaining all aerobic life on Earth. The first 
reaction in oxygenic photosynthesis occurs in 
PSI, which harbors the oxygen-evolving com- 
plex (OEC) that catalyzes stepwise oxidation of 
water through the S-state cycle (S;), which in- 
cludes the ground state Sy and four oxidized 
intermediates, S, to S, (J, 2). Dioxygen is gen- 
erated in the final step of the S-state transition 
S3—(S4)—Sp (3). The OEC contains a Mn,CaO; 
cluster organized in a distorted-chair form, 
in which an external Mn is attached to a 
Mn3CaO, cubane by two p-oxo O04 and O5 
atoms (4, 5). In the OEC, a ut-oxo O5 has 
unusually long distances from the nearby Mn 
ions, suggesting weak binding and higher re- 
activity. O5 may thus serve as one of the 
substrates for O=O bond formation (J, 2, 4-10). 
Time-resolved, pump-probe x-ray free electron 
laser crystallographic analysis of PSII in the S; 
state at 2.35-A resolution showed an incorpo- 
ration of an oxygen O6 into the OEC in the 
vicinity of 05, supporting a dioxygen forma- 
tion mechanism between O5 and O6 (11). This 
reaction can proceed via either (i) an oxyl/oxo 
radical coupling (12), (ii) a nucleophilic attack 
reaction mechanism (13), or (iii) a peroxide 
intermediate mechanism (/4). Owing to the 


limited resolution of the S3-state structure, 
however, the chemical entity of the oxo inter- 
mediates (superoxo, peroxo, oxyl/oxo, and 
hydroxo/oxo) has not been identified, therefore 
the reaction mechanism was not determined 
unambiguously. 


*s water cluster 


CL1-channel*, 


Lys-317p2 


Asn-181 


His-190 


The OEC is located at a node of five water 
channels (4) (Fig. 1) involved in proton release, 
balancing the net charge of the OEC, and inlet 
of substrate water (/, 15). Dislocation of water 
W665 in the O4 channel upon progress of the 
S; state results in closure of the 15-A-long, 
water-mediated hydrogen bonding network 
(11, 16). We previously attributed this struc- 
tural change to a proton release through the 
O4 channel, whereas others have argued that 
W665 serves as the source of the O6 atom 
involved in O=O bond formation (17, 18). Lack 
of a high-resolution structure of the S, state 
has contributed to uncertainty about the struc- 
tural changes leading to or from this state. To 
address these issues, we fixed PSII in the S,, So, 
and Sz states, and the triply flashed (3F) state 
by a cryo-trapping method with microcrystals 
of PSII from the thermophilic cyanobacterium 
Thermosynechococcus vulcanus. Structures at 
2.15- to 2.50-A resolution using fixed-target 
serial femtosecond crystallography (SFX) 
are consistent with the structures at room 
temperature determined previously (Z1, 16) 
but allow more accurate determination of 
interatomic distances in the OEC, reveal- 
ing the chemical entity of the intermediate 
O5 and O6 species and the corresponding 
changes in the protein-ligand environment 


Fig. 1. Water 
networks in PSII. 
Five hydrogen-bonded 
water networks 
surround the OEC 
(PDB ID 4UB6) (5). 
The upper-bound 
distance for the 
hydrogen bonds 
shown is 3.3 A. CL1 
and CL2 represent 
chloride ions found 
near the OEC. 


Asn-298-channel 


va Asn-298 
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Fig. 2. Structural changes of the OEC during the S,-state transitions. 

(A to C) OEC structures superimposed with Fops — Fobs isomorphous difference 
Fourier maps of (A) 1F minus dark, (B) 2F minus 1F, and (C) 2F minus dark 
datasets. Structures before and after Sj-state transition are shown in gray and 
color, respectively. Difference maps are contoured at -3o (red) and 36 (cyan). 
Structural changes consistent with isomorphous difference Fourier maps are 
epresented by black arrows, where larger arrowheads represent the larger 
structural changes. (D to F) Interatomic distances (angstroms) of the OECs 

in the (D) Si, (E) Sz, and (F) S3 states. Blue and red lines indicate elongation 
and shortening of the interatomic distances compared with the structure in the 


= 


H 05-06 omitted O6 omitted 


precedent S; state. Presumed Mn (+4) and Mn (+3) cations are shown. 

(G) The Fops — Feaic difference Fourier maps contoured at —2.20 (red) and 
+2.2o (cyan) after structural refinement by fixing the distance between 05 and 
06 at 1.3 A (superoxide), 1.5 A (peroxo), 1.7 A, 1.9 A (oxyl/oxo), and 2.4 A 
(hydroxo/oxo), respectively. Values in parentheses are temperature factors 
(square angstroms) of 05 and O6. Black arrows indicate the residual electron 
densities that were affected by the interatomic distances between 05 and 06. 
(H) The Fp; — Feale difference Fourier maps contoured at 5o (blue) when 

both 05 and 06, or just 06, were omitted. Colors used here are the same in all 
figures unless otherwise noted. 


responsible for the proton exit and water 
inlet during the S-state cycle. 


Structural determination 


Single-shot diffraction images were collected 
in a fixed-target data collection manner at a 
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cryogenic temperature (5, 19), in which PSII 
microcrystals were used (fig. S1, see supple- 
mentary materials and methods). Compared 
with SFX of PSII using a grease matrix as the 
injection medium at room temperature (11), 
this method reduced the sample consump- 
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tion by one order of magnitude and ensured 
low background images, allowing us to collect 
diffraction images with high hit and index 
rates, which yielded 2.15-A resolution datasets 
for PSII in the S,, S2, and Sz states (dataset 1 in 
tables S1 and S2). We collected datasets of a 3F 
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state and S,, Sz, and Sg states (dataset 2) at 
2.35- to 2.50-A resolution independently to con- 
firm the reproducibility of the light-induced 
structural changes and to examine the struc- 
tural changes beyond the Sz state. 

Strong peaks in the isomorphous difference 
Fourier maps calculated between each state 
(Fig. 2, A to C, and fig. S3) indicate successful 
detection of the structural changes induced 
by the flash illuminations. Most peaks were 
localized in the vicinity of OEC, in agreement 
with the previous observations at room tem- 
perature by the SFX method (J7) (Figs. 2 to 4 
and fig. S3). However, structural changes in 
the quinone Q,-binding site were diminished 
substantially, reflecting the relaxed protein envi- 
ronment due to the longer delay time after 
flash illumination in the present study (1 s in 


A 


the fixed-target method versus 10 ms in the 
SFX method). Pairs of positive and negative 
density peaks were more clearly visible in the 
current difference maps around the OEC owing 
to the high resolution achieved as well as the 
high isomorphism between the different S-state 
datasets. Changes in the interatomic distances 
between the four Mn atoms accompanying the 
S,-State transition (table $4) reflect changes of 
the oxidation states of the Mn ions. Difference 
maps between sequential states allowed us to 
define the order of serial structural changes 
induced by S-state progression. 


The S; and S> states 


The Mn-Mn distances in the S, state in our 
structure (Fig. 2D and table S4) are similar 
to previous results (5, 1, 16, 20), with the ex- 
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Fig. 3. Structural changes in the 04 channel. (A) F,4; — Fops isomorphous difference Fourier map 
of 1F minus dark contoured at 30 superimposed with the hydrogen-bonding network in the 04 
channel. Dislodged water molecules are highlighted by blue circles. (B) Fobs — Feaic when water or 


O6 were omitted (top) or Fops - 


Fp; (bottom) difference density values at water sites in the 04 


channel. rmsd, root mean square deviation. (C) Possible proton transfer mechanism in the 04 
channel. Hydrogen-bonding networks before (left) and after (right) proton transfer. The putative 
proton stored at the water cluster is depicted in green and indicated by a red arrow. 
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ception being that the shortest distance (Mn1- 
Mn2, 2.60 A) is slightly shorter than what was 
previously reported. The S, state may adopt 
either an open or a closed cubane form accord- 
ing to theoretical studies (27, 22). The open 
cubane corresponds to the structure giving 
rise to the S=1 /2, & = 2.0 electron para- 
magnetic resonance (EPR) multiline signal 
(S, spin; g, g-factor, a dimensionless quan- 
tity that characterizes the magnetic moment 
and angular momentum of an atom), and the 
closed cubane to the S = 5 /o, g = 4.1 signal 
(21, 22). The S. state in our structure is in the 
open cubane form, in agreement with the 
absence of the g = 4.1 EPR signal in cyano- 
bacterial PSII under normal conditions (23). 
Difference density analysis reveals that, upon 
§,-to-S, transition, Mn4 is shifted toward 
Glu”? and Ca is moved toward Yz (Tyr’®’) 
slightly (Fig. 2A). There is a shift of O5 toward 
Mn4 and a weakening or breakage of the oxy 
bridge between O5 and Mn1 in the Sy state 
(fig. S4). These changes are consistent with 
Mn4: oxidation during the S,-to-S, transition, 
giving rise to a bipyramidal five coordinated 
Mni and an open cubane OEC structure con- 
sistent with a charge distribution of (Mn1, 
Mn2, Mn3, Mn4) = (III, IV, IV, IV). Slight 
changes in Mn-Mn distances are observed 
in the S, state (Fig. 2E), resulting in an in- 
creased homogeneity in these distances con- 
sistent with x-ray absorption fine structure 
measurements (24). 

In addition to the structural changes of 
OEC, strong negative densities were found 
for two water molecules, W665 and W571, that 
are hydrogen-bonded with ligand residues of 
the OEC (Fig. 2, A and C). These changes have 
been found in the S,-to-S3 transition (77), but 
the present results show that these two water 
molecules already become highly disordered 
in the S,-to-S, transition; therefore, the possi- 
bility of W665 as the substrate for O=O bond 
formation (/7, 18) is unlikely. W665 is the sec- 
ond water molecule from O4 within the 15-A- 
long O4 water chain that ends at a water 
cluster consisting of five water molecules ex- 
posed to the luminal solution (Fig. 1). This 
water cluster may accept a proton released 
from OEC as a protonated Eigen cation through 
the Grotthuss-type proton transfer (25), con- 
sistent with Fourier transform infrared spec- 
troscopy, which showed changes in a highly 
polarized hydrogen-bond network (26-28) or 
possible formation of a nH,O(H,0)* cluster 
in the S, state (29, 30). Despite a long dis- 
tance between the water cluster and the OEC, 
change was observed in both electron den- 
sity and the shape of the water cluster (Fig. 
3, Aand B). These changes may result from 
positive charge accumulated during the S,-to- 
S, transition, possibly by the ejection of a 
proton from the OEC through W665. The 
increased mobility of W665 may be required 
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to prevent the backflow of the proton by dis- 
connecting the water chain (Fig. 3C) and is 
accompanied by a slight shift of the side chain 
of CP43 Glu®* hydrogen-bonded with W665. 
W571 also increases its mobility on the S,-to-S 
transition (Figs. 2, A and C, and 3B), likely asa 
result of the accumulated positive charges on 
the OEC. W665 appeared in its position again 
in the transition of Sz to 3F state (supplemen- 
tary text, fig. S3K), indicating regeneration 
of the long O04 hydrogen bond network after 
the S state for proton transfer in the next 
reaction cycle. 


The oxyl/oxo species in the S3 state 


Larger difference densities were observed 
around the OEC for the S,-to-S; transition 
than for the Sj-to-S, transition (Fig. 2, A and 
B, and fig. S3, B and C). Mn1, Mn2, and Mn3 
are static, but Mn4 is moved toward Ser! 
(Fig. 2B), resulting in an increase in the Mn1- 
Mn4 distance (Fig. 2, E and F). Flipping of the 
side chain of Glu’®®, also seen at room tem- 
perature (11), provides an open space in the 
vicinity of 05, enabling the insertion of O6 
(Fig. 4, A and B). The improved resolution of 
the dataset allowed us to identify the position 
of O6 clearly in both Fy,, — Fops and Fops — Feate 
difference maps and thus determine its chem- 
ical structure unambiguously (Fig. 2, B, C, and H). 
By altering the 05-06 distance and examining 
the residual densities in the Foy. — Frac difference 
Fourier map, we found that a distance of 19 A 
resulted in the weakest residual densities (Fig. 
2G) (see materials and methods). This distance 
is slightly longer than the 1.5 A we reported 
previously (77) and consistent with an oxyl/oxo 
pair for the O5 and O6 species. Distances for 
superoxo (1.3 A), peroxo (1.5 A), and hydroxo/ 
oxo (2.4 A) species can be excluded on the 
basis of their increased residual densities. Fur- 
thermore, a hydroxo/oxo pair cannot be ac- 
commodated by the current structure owing 
to the limited space available, unless the in- 
teratomic distance between O5 and Mn4 is 
shorter by 0.3 A. A peroxo species fits with 
the electron density similarly to that of the 
oxyl/oxo species, explaining why we could 
not discriminate in the previous study (11); 
however, the oxyl/oxo species resulted in less 
residual electron densities in the Fops — Feaic 
map and evenly distributed temperature fac- 
tors for O5 and O6. The interatomic distances 
of OEC in the S; state agree well with those 
of the theoretically optimized structure of S, 
when the oxyl/oxo species was assumed in 
the open cubane form (table S5) (37). Thus, 
we conclude that the 05 and O6 pair is in 
an oxyl/oxo form, which would be consistent 
with an oxyl/oxo coupling mechanism for the 
O=O bond formation (37). 

Movement of D1 Glu!®°, the only mono- 
dentate carboxylate ligand of the Mn,CaO; 
cluster, by 0.5 A during the S,-to-S; transition 
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Fig. 4. Structural changes in the water-inlet 01 channel and around the Ca”* ion. (A) Isomorphous 
difference map (30) superimposed with the O1 channel. 06 and two water molecules that appear in the 

S3 state are marked with a red dashed circle. (B) Model of structural changes in the O1 channel. (C) The 
ligand environment of Ca?* in the S; and S3 states. Weak interactions between Glu’®? and Ca**, and between 
Glu'®? and 06, are shown as dashed lines. Possible hydrogen atom transferred from 06 to Glu'®? is 

shown as a solid green line. (D) Schematic structures of OEC and the states of the 01 and 04 channels 

in the S,-S3 states. The water molecule from the O01 channel may be incorporated into 06 in S3 or refill the 
water molecules waiting for the next catalytic reaction. 
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causes an elongation of the distance between 
its OE1 and Ca”* from 3.1 A in the S, state to 
3.5 Ain the Sz state (Fig. 4C). The Ca?* ion can 
be considered to have a coordination number 
of 7.5 or pseudo-eight-coordinated in the S,; 
state, because it has seven “normal” ligands 
plus D1 Glu’®®. In the Sz state, the interaction 
between D1 Glu’®° and Ca?* is much weaker, 
but the newly inserted O6 provides an eighth 
ligand to Ca?* with a distance of 2.6 A (Fig. 40), 
which is made possible by the ability of Ca?* 
to adopt coordination numbers from four to 
eight (32). The two hydrogen atoms brought 
by O6 are, respectively, ejected from the active 
site during the S,-to-S, transition and accepted 
by D1 Glu’®®, which forms a hydrogen bond 
with O06 (Fig. 4C), giving rise to the oxyl/oxo 
species between O6 and O5. 


The mechanism for substrate water delivery 


Flipping of the Glu" side chain is correlated 


with motions in a short loop of CP43, includ- 
ing CP43 Val*"®, that restrict the size of the O1 
channel (Fig. 4, A and B). Paired positive and 
negative densities were found around CP43 
Val*"° indicating a movement of this residue 
toward Glu'®® py 0.5 A in the Ss; state. This 
movement substantially widens the channel 
radius (Fig. 4, A and B), which may allow 
water molecules to come in during the S-state 
transition. Strong negative difference densities 
were found in positions overlapping with 
W503, W547, and W554 as well as glycerol- 
526 (Gol526) present in the channel, reflect- 
ing increased mobility of these molecules in 
response to the channel opening in the S, 
state. Two positive difference densities in 
the hydrogen-bonded water network indicate 
the insertion of water molecules W666 and 
W667 into the O1 channel (Fig. 4, A and B). 
The O1 channel likely serves as a conduit for 
substrate water entry into the OEC, and the 
residues around CP43 Val*"° may serve as a 
“valve” to control the water-inlet channel. Move- 
ments in this region coincide with incorporation 
of O6 into the open cubane structure of OEC 
during the S,-to-S, transition. Thus, Glu’®? 
plays a pivotal role in coupling oxidation of 
the OEC with the opening of the water chan- 
nel and delivery of the substrate into the OEC. 


Outlook 


The structural changes related to proton re- 
lease, water inlet, and O=O bond formation 
during the S-state cycle are summarized in 
Fig. 4D. The displacement of W665 in the O04 


Suga et al., Science 366, 334-338 (2019) 


channel suggests a proton transfer from the 
OEC upon §,-to-S, transition. Considering that 
theoretical calculations favor O5 as a hydroxide 
ion in S, and an oxide ion in the open cubane 
S, structure (33), and that there is no proton 
release to the bulk solution in the S,-to-S, 
transition, a proton is likely ejected from the 
O5 site and stored as a protonated Eigen cation 
in PSII. The structural changes observed in 
the O4 channel thus provide insights into the 
timing of proton release and an elegant way 
to prevent the backflow of the released proton. 
Insertion of O6 occurs in the S,-to-S3 transi- 
tion, providing an oxyl/oxo species in the S3 
state. Structural changes of the Glu’®? side 
chain serve to couple oxidation of the cat- 
alytic site with substrate water access, proton 
release, and O=O bond formation via the oxyl/ 
oxo coupling mechanism. The structure and 
O=O bond formation mechanism revealed 
here should serve as an important blueprint 
for rational design of artificial catalysts that 
have a capability for oxidizing water by visi- 
ble light. 
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Human activities are fundamentally altering biodiversity. Projections of declines at the global scale 

are contrasted by highly variable trends at local scales, suggesting that biodiversity change may be 
spatially structured. Here, we examined spatial variation in species richness and composition change 
using more than 50,000 biodiversity time series from 239 studies and found clear geographic variation 


in biodiversity change. Rapid compositional change 


is prevalent, with marine biomes exceeding and 


terrestrial biomes trailing the overall trend. Assemblage richness is not changing on average, although 
locations exhibiting increasing and decreasing trends of up to about 20% per year were found in 
some marine studies. At local scales, widespread compositional reorganization is most often decoupled 
from richness change, and biodiversity change is strongest and most variable in the oceans. 


umans are reshaping biodiversity pat- 

terns. Against a background of elevated 

extinction rates (J, 2), local biodiversity 

change results from multiple interact- 

ing drivers that influence the abundance 
and distribution of species. Different regions 
of the globe are projected to experience dif- 
ferent trends in biodiversity change, particu- 
larly those caused by variations in the strength 
of drivers such as land-use intensity (3) and 
climate change (4). There are widespread 
changes in the identities of species that live 
in any one location (species composition), 
whereas shifts in the numbers of species 
(species richness) show mixed patterns, with 
increasing, decreasing, or static trends (5-9). 
However, the spatial distribution of the loca- 
tions most affected is unknown. Here, we 
map biodiversity change, in terms of species 
richness and composition, to uncover the ge- 
ography of biodiversity change. Our analysis 
compares assemblage time series across the 
marine, terrestrial, and freshwater realms, 
different biomes, and latitudinal bands (.e., 
polar, temperate, and tropical). 

Both biodiversity and its change are un- 
evenly distributed on the planet (JO, 17) and 
unevenly sampled (12-15). Species densities 
typically decline drastically from the tropics 
to the poles, and the identities of species differ 


across continents and oceans. Hence, knowing 
which locations are undergoing different types 
of net change in biodiversity is critical to under- 
standing how biodiversity is changing globally. 
Detecting geographic variation in biodiversity 
trends will not only improve our understand- 
ing of how global biodiversity is changing but 
will also inform conservation prioritization. 
Specifically, by identifying the regions of the 
planet that are changing more, we will be 
better placed to make informed decisions 
about the spatial distribution of biodiversity 
vulnerability and about where to prioritize 
reactive (such as restoration) and proactive 
(protection) conservation actions (76). In addi- 
tion, quantifying this spatial distribution will 
refine hypotheses about the drivers of bio- 
diversity change. 

Spatial patterns in biodiversity change are 
the combined result of species changing their 
distributions, entering and leaving local com- 
munities, going extinct, or speciating. These 
processes are affected by many drivers, which 
themselves are spatially heterogeneous (17, 78) 
and differ between the marine and terrestrial 
realms (19). For example, spatial overlap between 
climate change and other drivers of change is 
greater in the marine realm than in the ter- 
restrial realm (79). Moreover, species sensi- 
tivities to climate change (i.e., temperature 


increases) are also greater in the marine realm 
(20, 21). When coupled with ecological differ- 
ences between realms, such as fewer barriers 
to dispersal and greater colonization rates in 
marine ecosystems (22, 23), these differences 
may result in greater compositional change 
in marine compared with terrestrial assem- 
blages (27). Therefore, we predict that biodiversity 
in the marine realm has changed more than 
it has in the terrestrial realm. Changes in com- 
munity composition are not necessarily asso- 
ciated with changes in species richness if species 
gains and losses are approximately balanced 
(5-9, 24, 25). However, in regions where land- 
use intensity is high (26) or where range sizes 
contract in response to climate change (27), 
species extirpations could result in a decrease 
in species richness. Conversely, in regions sub- 
ject to high rates of species introductions (28-37) 
or high connectivity, or where ranges expand 
(32, 33) or species are broadly favored by land- 
use change (34), species richness could increase. 
Hence, we expect there to be variation in bio- 
diversity change across different biomes and 
geographical regions of the planet. There may 
also be latitudinal differences in biodiversity 
change. For example, tropical regions are en- 
tering climatic conditions with no present-day 
equivalents (35), and intensification of land- 
use change is more recent than in temperate 
regions (36); therefore, richness loss may be 
more prevalent and more extreme in tropical 
latitudes. Here, we investigated whether bio- 
diversity change differs in magnitude between 
the realms and if strong geographic pat- 
terns exist in the changes to species richness 
and composition across realms, latitudes, or 
regions. 

We examined geographic variation in pat- 
terns of change in both species richness and 
composition using local assemblage time se- 
ries from across the globe (37, 38) (fig. S1). 
The BioTIME database is currently the largest 
compilation of assemblage time series, and 
our analysis included 239 independent studies 
(table S1). Because spatial extent varied con- 
siderably among studies, we used a grid- 
ding method (96-km? hexagonal cells) (39) 
to partition the 126 studies that had mul- 
tiple sampling locations and large spatial 
extents (38); 113 studies were not partitioned 
because they were contained within a single 
grid cell. This resulted in 51,932 individual 


‘German Centre for Integrative Biodiversity Research (iDiv), Hal 


je-Jena-Leipzig, Germany. “Martin Luther University Halle-Wittenberg, Institute of Computer Science, Halle (Saale), Germany. “Data 


Analytics Program, Denison University, Granville, OH, USA. “Centre for Biological Diversity, School of Biology, University of St. Andrews, St. Andrews, UK. Department of Biology and CESAM, 
Universidade de Aveiro, Aveiro, Portugal. “Research Centre for Ecological Change, Organismal and Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland. 
Department of Ocean Sciences, Memorial University of Newfoundland, Newfoundland, Canada. ®Martin Luther University Halle-Wittenberg, Institute of Biology/Geobotany and Botanical Garden, 
Halle (Saale), Germany. °School of Biology and Ecology, University of Maine, Orono, ME, USA. !°School of GeoSciences, University of Edinburgh, Edinburgh, UK. “Senckenberg Gesellschaft fiir 
Naturforschung, Biodiversity and Climate Research Centre (BiK-F), Frankfurt am Main, Germany. Department of Biology, University of Massachusetts Boston, Boston, MA, USA. “Department of 
Biology, Quebec Centre for Biodiversity Science, McGill University, Montreal, QC, Canada. “Leipzig University, Institute of Biology, Leipzig, Germany. Department of Ecology, Evolution, and 
Behavior, University of Minnesota, St. Paul, MN, USA. “Department of Biological Sciences and Institute for the Study of the Environment, Sustainability, and Energy, Northern Illinois University, 
DeKalb, IL, USA. ’Department of Zoology, University of British Columbia, Vancouver, BC, Canada. “Département de biologie, Université de Sherbrooke, Sherbrooke, QC, Canada. 

Ocean and Earth Science, National Oceanography Centre, University of Southampton, Southampton, UK & Life Sciences, Natural History Museum, Cromwell Road, London, UK. 

*Corresponding author. Email: sablowes@gmail.com (S.A.B.); supps@denison.edu (S.R.S.); maadd@st-andrews.ac.uk (M.D.) {These authors contributed equally to this work. 


Blowes et al., Science 366, 339-345 (2019) 


18 October 2019 


lof7 


RESEARCH | RESEARCH ARTICLE 


local assemblage time series, with each time 
series composed of samples from only one 
study. This means that important study-level 
considerations (e.g., sampling method) were 
consistent within each time series. After fur- 


A Marine 


@ » Biome departu 


(estimate) 


ther filtering by sampling completeness and 
standardization by sample-based rarefaction 
(38), these time series became the lowest level 
in our hierarchical models of temporal trends. 
Temporal extent and start date vary substan- 


tially within these data: time series span from 
the late 1800s to the present, though most 
data come from the past 40 years (fig. S2), and 
we examined the sensitivity of our results to 
this heterogeneity. 
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Fig. 1. Species richness change maps showing departures from the overall 
trend for marine and terrestrial biomes. Inset shows the overall trend in 
assemblage species richness change; bar depicts 50% (thick) and 90% (thin) 
credible intervals. Shading on the map represents positive (blue; faster 
increases in species richness than average) and negative (red; slower increases 
in species richness than average) departures for each biome from the overall 
average species richness change (0.004 log species per year). Numbers in the 
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inset denote the departure and the biome-level (overall + departure) 

estimate in parentheses. The 90% credible intervals for all biome-level estimates 
overlap zero. (A) Marine biomes (n = 33) show both positive and negative 
departures from the overall trend, with more negative departures in the 
tropics, whereas there are no latitudinal trends in (B) terrestrial (n = 10) and 
freshwater (n = 5) biomes, which also show both positive and negative 
departures from the overall trend. 
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Biodiversity trends across the globe 

To examine geographic patterns in biodiversity 
change, we quantified realm, latitudinal, and 
regional departures from the overall trends of 
richness and composition change using hier- 
archical generalized linear models. We first 
nested the cell-level time series within the 239 
original studies to control for effects of sampling 
methods and nonindependence of cell-level time 
series that came from a single study. Through- 
out our analysis, we controlled for not having 
the same taxa sampled everywhere by includ- 
ing taxon in our models. Our first model, the 
biome-taxon model, nested studies into nine 
taxonomic-habitat groupings that were fur- 
ther nested within 48 biomes [defined by the 
Ecoregions of the World datasets available from 
The Nature Conservancy website; http://maps. 
tnc.org/gis_data.html, TNC terrestrial regions 
dataset (40-43)]; this resulted in 321 biome- 
taxon study combinations grouped within 
105 specific biome-taxon combinations. The 
48 biomes, including 33 marine (41), 10 terres- 
trial (44), and 5 freshwater (42), represent geo- 
graphic regions of the world and allowed us 
to characterize spatial patterns as biome-level 
departures from the overall trend of biodiversity 
change for each realm. The taxonomic-habitat 
groupings, dictated by specifications in the orig- 
inal studies, were amphibians, benthos, birds, 
fish, invertebrates, mammals, marine inverte- 
brates and plants, plants, and multiple taxa, 
and were included to contrast and control for 
differences in trends among taxa within the 
different biomes. We examined the robustness 
of our biome-taxon models, and the spatial 
patterns they identified, by fitting two comple- 
mentary hierarchical models with simpler geo- 
graphic structures. Here, we focus on the simplest 
model (referred to as the realm-latitude-taxon 
model) and present a model of intermediate 
complexity in the supplement (38). All models 
grouped cells within studies at the lowest level, 
and the realm-latitude-taxon model grouped 
studies into 29 specific combinations of realm 
(marine, terrestrial, freshwater), latitude (polar, 
temperate, tropical), and taxonomic-habitat 
group, allowing us to characterize variation in 
biodiversity change for taxon groups across 
broad latitudinal bands within each realm. 
The different geographic structure meant that 
this model included 271 studies when we ap- 
plied our threshold of three cell-level time series 
per realm-latitude-taxon group (38). Results of 
all models were qualitatively consistent, both 
in terms of the overall trends they estimated 
and at the lowest levels (i.e., at the study and 
cell levels: fig. S3), suggesting that our infer- 
ences are largely robust to differences in how 
we searched for geographic patterns. Addition- 
ally, we found our results relatively insensitive 
to the heterogeneity in temporal extent of the 
data and did not detect systematic effects of the 
number of years sampled, temporal duration, 
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depicts the 90% credible interval) for the realm-latitude-taxon study model. (A) Density ridges of the taxon- 
level slope coefficients (color represents the taxonomic group). (B) Density ridges of the posterior 
distributions of the study-level slope coefficients within a given combination of realm and latitudinal band 


estimated with the realm-latitude-taxon model. 


start year, or the initial species richness on the 
estimates of rates of change (figs. S7 and S8). 
Our biome-taxon model results show that 
variation in biodiversity change is greater in 
the marine versus the terrestrial and fresh- 
water realms. The overall average of richness 
change was not statistically distinguishable 
from zero globally or for any individual biome 
(Fig. 1). The magnitude of positive departures 
from the overall trend was greater among ma- 
rine biomes (range of median biome departure: 
-0.0003 to 0.001, n = 33; Fig. 1A) compared 
with terrestrial and freshwater biomes (-0.0007 
to 0.0001, 2 = 15; Fig. 1B), but richness trends did 
not vary substantially among biomes (Ogiome = 
0.004) or for taxon groups within biomes 
(Ggiome-taxon = 0.003). Instead, the main level of 
variation was at the study level (Opiome-taxon-study = 
0.04), where specific studies exhibited species 
richness increases or decreases of up to 20% 
per year in the marine realm and up to 10% 
in the terrestrial realm (fig. S9). Twenty-three 
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marine, five terrestrial, and two freshwater 
studies showed significant species richness 
losses, whereas 31 marine and nine terrestrial 
studies showed significant gains. These results 
were consistent with the realm-latitude-taxon 
model that showed change centered on zero 
for all latitudinal bands (Fig. 2A), with the 
greatest variation observed in the marine realm 
at the study level, particularly in polar and trop- 
ical latitudes (Fig. 2B). Data limitations from 
tropical systems remain in our assemblage 
time-series data (e.g., no tropical freshwater 
assemblages), precluding some direct compar- 
ison between realms (see also fig. S1). The high 
rates of change that we observed in the marine 
tropics (Fig. 2B) are consistent with predic- 
tions that tropical marine species will be rela- 
tively sensitive to extreme heat events because 
they are closer to their physiological limits 
(20, 21), in addition to overexploitation, pollu- 
tion, and other threats occurring in the marine 
tropics (36). 
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Fig. 3. Species turnover component maps showing departures from the 
overall trend for marine and terrestrial biomes. Assemblages across the 
globe are experiencing high rates of species replacement (median ~28% of 
species replaced per decade). Shading represents positive (blue; faster turnover 
than average) and negative (red, slower turnover than average) departures from 
the overall trend for each biome; numbers in the inset denote the departure 
and the biome level (overall + departure) estimate in parentheses. (A) Rates of 
arriving species replacing original species have both faster (blue) and slower 
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rates of turnover (red) from the overall trend in marine biomes, but included 
the biomes with the highest turnover rates: the 90% credible intervals in the 
warm temperate northwestern Atlantic, warm temperate southwestern Atlantic, 
and the northwest Australian shelf biomes were greater than the overall trend, 
whereas (B) terrestrial and freshwater biomes have mostly slower rates of 
turnover than the overall trend (red shading), and the 90% credible intervals for 
temperate broadleaf and mixed forests, temperate conifer forests, and tropical 
and subtropical moist broadleaf forests were lower than the overall trend. 
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To examine changes in species composi- 
tion, we partitioned total Jaccard dissimilarity, 
calculated as the dissimilarity between the ini- 
tial year and each subsequent year of a time 
series, into the additive components of turn- 
over and nestedness (45). These trends describe 
directional compositional change relative to 
the initial assemblage, and the decomposition 
determines whether changes in community 
composition were caused by the original species 
in assemblages being replaced by other species 
(turnover) or if assemblages were becoming 
smaller subsets of themselves or growing to 
include additional species alongside the orig- 
inal species (nestedness). Overall, we found 
that rates of turnover were positive and much 
greater (0.028; 90% credible interval: 0.023 
to 0.032; Fig. 3) than the rates of change in 
nestedness (0.006; 0.006 to 0.007; fig. S12). 
Compositional change was dominated by spe- 
cies replacement within assemblages, with 
~28% of species being replaced per decade. 
Variation at the biome level was much greater 


for turnover (Opgiome = 0.01) compared with 
species richness, resulting in stronger geo- 
graphic patterns and revealing further differ- 
ences between marine and terrestrial realms 
(Fig. 4). Three marine biomes (warm temper- 
ate northwest and southwest Atlantic and the 
northwest Australian shelf) had rates of turn- 
over greater than the overall trend. By con- 
trast, three terrestrial biomes (temperate 
broadleaf and mixed forests, temperate con- 
ifer forests, and tropical and subtropical moist 
broadleaf forests) had rates of turnover slower 
than the global trend, and most terrestrial 
biomes showed negative departures from the 
global average (Fig. 3B). Positive departures 
from the overall trend in terrestrial and fresh- 
water biomes were found in aquatic systems: 
large lakes, mangroves, and polar freshwaters. 
These trends of directional compositional 
change are highly unlikely to have arisen sim- 
ply from random assemblages being drawn 
from relatively constant regional species pools. 
Simulations show that such a process has a 
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Fig. 4. Posterior distributions of turnover for the realm-latitude-taxon model. The overall trend in 
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median slope of zero for both turnover and 
nestedness change (38) (fig. S10). Addition- 
ally, we found that higher rates of composi- 
tional change in marine and freshwater biomes 
were associated with a higher proportion of 
assemblages undergoing complete turnover 
(fig. S13) and were robust to our choice of error 
distribution (fig. S14) and whether compari- 
sons were made with the initial assemblage 
or between assemblages at consecutive time 
points (38) (figs. S15 and S16). 


Linking richness and composition change 


To examine the relationship between changes 
in species richness and changes in composition, 
we plotted the dominant component of com- 
position change (turnover or nestedness) for 
each biome-taxon-study combination against 
species richness change (46) (Fig. 5, A and B). 
When turnover is the dominant component, 
this relationship shows how quickly different 
species are replacing original species and whether 
these arrivals are associated with changes to 
the number of species. At the study level, rates 
of turnover exceeded nestedness change for 
>97% of biome-taxon-study combinations 
(313/321; Fig. 5, B and C). Among these studies, 
~23% (57/313) exhibited trends different from 
zero for both turnover and species richness 
rates (Fig. 5C), with a relatively balanced dis- 
tribution of 23 cases of species richness losses 
and 34 cases with gains. We did not adjust for 
multiple comparisons, though they are less of 
a problem when comparing partially pooled 
estimates from hierarchical models (47). When 
nestedness is the dominant component, this 
relationship shows how fast assemblages are 
changing to become smaller subsets or grow- 
ing to include additional species alongside the 
species initially observed. Among the eight 
biome-taxon-study combinations where nest- 
edness exceeded turnover change (8/321; Fig. 
5, B and C), only two showed rates of nested- 
ness and richness trends different from zero, 
with one losing and one gaining species. Our 
combined results for turnover and species 
richness change support recent studies re- 
porting that different components of bio- 
diversity change, such as composition shifts 
and species richness, are largely uncoupled 
(5, 7-9). In fact, we found that high rates of 
turnover were associated with the full spec- 
trum of richness changes. 


Discussion 


Compositional change dominated by species 
turnover is the most conspicuous and preva- 
lent form of biodiversity change across the globe 
and was characterized by strong geographic 
structure. Only marine biomes were found to 
have faster rates of compositional change than 
the overall trend, whereas only terrestrial biomes 
were observed to trail the overall compositional 
trend. Moreover, marine studies exhibited greater 
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variation in rates of compositional turnover. 
We also found that most studies, across all 
biomes and realms, showed considerable re- 
placement of species through time without 
associated species richness changes. This 
finding, using the largest global dataset to 
date, is consistent with the assertion that 
species richness trends are often uncoupled 
from species replacement, and thus are in- 
sufficient alone for fully capturing how bio- 
diversity might change (9). The consistent 


pattern of species replacement is likely under- 
pinned by a diverse suite of drivers affecting 
different study sites, regions, and realms. Such 
reorganization independent of changes to the 
number of species is also consistent with the 
presence of regulatory mechanisms for spe- 
cies richness. Community regulation of spe- 
cies richness is widespread (48) and may be 
driven, for example, by shared resources (24). 
Contemporary pressures such as introduced 
species (29-31), replacement of localized spe- 
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cialists by widespread generalists (49, 50), range 
shifts in response to environmental change 
(22, 51), or local warming (52, 53) may also 
help to explain our finding of widespread com- 
position change associated with variable rich- 
ness change at the study level. 

Rates of species richness change and turn- 
over were higher in absolute magnitude and 
more variable in the marine realm, with maxi- 
mum turnover rates in marine biomes twice 
those observed for terrestrial biomes. Higher 
rates of turnover in the marine realm are con- 
sistent with predictions for species responses 
on the basis of greater sensitivities to increased 
temperatures (20, 21) coupled with fewer bar- 
riers to dispersal in marine systems (22, 23), 
though attribution to specific drivers is be- 
yond the scope of analyses presented here. 
Further, if assemblages are more spatially het- 
erogeneous in marine compared with terres- 
trial systems, then this too may contribute to 
our finding of higher temporal turnover in 
marine assemblages (54). Although we did 
not find strong contrasting trends for spe- 
cific taxonomic groups within biomes, envi- 
ronmental differences between the realms 
(unrelated to dispersal and connectivity) or 
life history differences among taxa could also 
underpin some of the patterns in turnover 
detected at the realm and regional scales. 
For example, in the temperate marine realm, 
mammals had lower turnover than inverte- 
brates, which is consistent with other findings 
showing that long-lived taxa exhibit less rapid 
temporal turnover (54). 

Amid widespread variation in biodiversity 
trends, we found that tropical marine regions 
have a higher proportion of studies exhibiting 
biodiversity change at the extremes of rich- 
ness gains, losses, and turnover (Figs. 2B 
and 4 and fig. S9). Hence, although we find 
higher magnitude changes in the tropics, this 
result contrasts with our prediction that we 
would find mostly richness losses. The tropics, 
which harbor most of the biological diversity 
on the planet, are also generally considered 
to be the place where biodiversity is the most 
threatened (36). Moreover, in the context of 
climate change, there are likely fewer spe- 
cies available to replace those species lost in 
tropical zones that have entered no-analog 
warm-temperature conditions (55, 56). If these 
trends are maintained, then this could lead to 
a market restructuring of biodiversity, with 
potentially severe consequences for ecosys- 
tem functioning across biomes and changes 
to the latitudinal diversity gradient, substan- 
tially altering the planet’s biogeography. How- 
ever, such a pattern of biotic attrition would 
be expected to be accompanied by a larger 
contribution of the nestedness component 
to community compositional change than 
we found here (Fig. 5 and fig. S12). BioTIME 
includes relatively few tropical datasets (37), 
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despite being the largest compilation of bio- 
diversity time series currently available, and 
further data collection in these areas is needed 
to be able to confidently assess trends in the 
tropics. Furthermore, biodiversity monitoring 
overall is lacking for many regions of the 
planet, e.g., the deep ocean. The geographical 
variation that we uncovered highlights the 
critical importance of improving the spatial 
coverage of biodiversity monitoring to better 
estimate global biodiversity change. 

We identified hotspots of biodiversity change, 
that is, key areas that represent extremes for 
biodiversity trends. As conservation has moved 
toward systematically identifying regions in 
need of protection (57), global maps of con- 
servation priorities according to different crite- 
ria have been developed (16, 58). Our study 
provides an important criterion for targeting 
conservation action: a global map of current 
rates of biodiversity change. In addition to the 
marine tropics, marine biomes in the western 
Atlantic and northwest Australia are under- 
going rates of replacement higher than the 
global average. Therefore, these regions are 
currently undergoing the most substantial 
rates of change and should be prioritized 
for reactive conservation measures. By con- 
trast, several forest biomes (e.g., temperate 
broadleaf and mixed, temperate conifer) have 
slower rates of replacement than the global 
average. Because these regions appear to be 
undergoing less change, we speculate that pro- 
active conservation measures are likely to be 
more appropriate. Specific conservation actions 
always need to be tailored to the locations and 
taxa, but our study provides the global and 
regional context in which individual locations 
are immersed. 

The global heterogeneity in biodiversity 
change is underpinned by geographic variation. 
We find that spatial variation in species gains 
and losses is greater than taxonomic variation. 
This spatial variation suggests that statements 
about biodiversity loss need to be conditional 
on context and location. On average, local 
species richness change across the globe does 
not differ from zero, but there are many loca- 
tions gaining or losing species. Species re- 
placement is ubiquitous and also spatially 
structured. Determining whether the spa- 
tial variation uncovered is related to differ- 
ences among communities in their degree of 
exposure and vulnerability to drivers of bio- 
diversity change is an important next step. 
Nevertheless, our results show that, although 
the entire planet is undergoing biodiversity 
change, the direction and magnitude of change 
differs across geographic regions. 
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Hierarchical interlocked orthogonal faulting in the 
2019 Ridgecrest earthquake sequence 
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A nearly 20-year hiatus in major seismic activity in southern California ended on 4 July 2019 with a 
sequence of intersecting earthquakes near the city of Ridgecrest, California. This sequence included 

a foreshock with a moment magnitude (M,,) of 6.4 followed by a M,, 7.1 mainshock nearly 34 hours later. 
Geodetic, seismic, and seismicity data provided an integrative view of this sequence, which ruptured 
an unmapped multiscale network of interlaced orthogonal faults. This complex fault geometry persists 
over the entire seismogenic depth range. The rupture of the mainshock terminated only a few kilometers 
from the major regional Garlock fault, triggering shallow creep and a substantial earthquake swarm. 
The repeated occurrence of multifault ruptures, as revealed by modern instrumentation and analysis 
techniques, poses a formidable challenge in quantifying regional seismic hazards. 


ver the past three decades, an increas- 
ing number of well-documented earth- 
quakes have ruptured multiple faults 
(1-5), highlighting the importance of 
incorporating this geometric complexity 
into models of seismic hazard. On 4 July 2019, 
a sequence of damaging earthquakes began 
near the city of Ridgecrest, California, and 
activated a complex fault network, further il- 
lustrating the need to understand how multiple 
faults can rupture in a single earthquake. 

The Ridgecrest sequence included a fore- 
shock with a moment magnitude (/,,) of 6.4: fol- 
lowed by an M,, 7.1 mainshock nearly 34 hours 
later (Fig. 1). These dominantly strike-slip earth- 
quakes occurred on largely unmapped faults 
that cumulatively extend more than 75 km 
in length. Shaking from these events was felt 
throughout portions of California, Nevada, and 
even Arizona. Damage was primarily concen- 
trated in the towns of Ridgecrest and Trona. 
The severity of damage was limited, as the area 
is sparsely populated. However, substantial 
portions of the ruptures occurred within the 
boundaries of the Naval Air Weapons Station 
at China Lake and caused major damage to 
facilities there. 

Seismic activity occurred within the Little 
Lake fault zone (LLFZ) and nearby Airport Lake 
fault zone, both of which have a long history of 
activity (Fig. 1), including multiple sequences 
with M,, of >5 between 1982 and 1996 (6). The 
LLFZ is accumulating right-lateral strain of 
about 1 mm/year (7) and is bounded to the 
southeast by the Garlock fault, a 260-km-long 
left-lateral strike-slip fault capable of produc- 
ing M, ~7.8 earthquakes (8). The Coso geo- 
thermal area abuts the LLFZ to the northwest 
and is the site of one of the largest geothermal 
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power plants in the United States. Numerous 
earthquake swarms have occurred in this area 
(9). To the northwest of Coso is the southern 
terminus of the 1872 M,, 7.5 Owens Valley earth- 
quake (10) (Fig. 1), which is among the largest 
historical earthquakes in California and was 
responsible for 27 deaths. 

In the time since the last major earthquake 
in California, the quality of seismic and geodetic 
data available has improved substantially. The 
Southern California Seismic Network has nearly 
doubled in size, Global Positioning System (GPS) 
stations have been installed throughout the 
region, and interferometric synthetic aperture 
radar (InSAR) data are available at unprece- 
dented levels of quality with shorter times 
between acquisitions. Here, we analyze these 
datasets with state-of-the-art techniques, includ- 
ing a high-resolution seismicity catalog derived 
from template matching, detailed multifault 
inversions of geodetic data with nonplanar ge- 
ometry, and subevent modeling of the largest 
events in the sequence. The observations indi- 
cate that the Ridgecrest sequence ruptured an 
astonishingly complex network of intertwined 
orthogonal faults operating over a broad range 
of length scales. They also show that the Garlock 
fault experienced shallow triggered postseismic 
creep and a large earthquake swarm; the role 
this major tectonic structure plays in limiting 
the southern extent of faulting and seismicity 
is yet to be understood. 


The 2019 Ridgecrest sequence 


The Ridgecrest mainshock was preceded by a 
prominent foreshock sequence that began with 
numerous events of local magnitude (Mz) ~0 
(Fig. 1). The seismicity rates escalated rapidly, 
with an M,, 4 event 2 hours later and an M,, 
6.4 earthquake shortly thereafter. Over the first 
21 days of the sequence, more than 111,000 
earthquakes with M of >0.5 occurred, includ- 
ing 70 events with an M of >4. 
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To resolve the geometry of the fault zone 
that ruptured during the Ridgecrest sequence, 
we produced a comprehensive relocated seis- 
micity catalog with template matching that had 
nominal relative resolution of ~100 m horizon- 
tally and 350 m vertically (1D (Fig. 2). We paired 
this catalog with a damage proxy map (DPM) 
(11) (Fig. 3), extracted from satellite InSAR 
coherence data, that identifies regions where 
radar-scattering properties of the ground have 
changed substantially relative to their inher- 
ent background rates of change (12). The fault 
zone exhibits pervasive orthogonal faulting over 
multiple length scales with notable geometric 
complexity (Figs. 2 and 3). The largest scale is 
for a roughly 55-km-long northwest-striking 
structure with an undulating geometry that is 
orthogonally cross-cut by a ~15-km-long fault. 
The larger of these was the primary structure 
that ruptured during the M,, 7.1 earthquake, 
while the smaller structure was the largest that 
ruptured during the M,, 6.4 event (Fig. 3). 

Around the northwest rupture terminus, the 
main fault dips steeply to the southwest before 
transitioning to a northeast dip of roughly 70° 
on the segment of the fault southeast of the 
M,, 7.1 hypocenter. Toward the southeast part 
of the rupture, the largest fault bifurcates into 
two subparallel strands that are separated by 
7 km and that continue for about 12 km each 
(Figs. 2 and 3). Near the main intersection of 
the M,, 7.1 and M,, 6.4 ruptures, there are at 
least 20 orthogonal faults that appear as linea- 
tions in the seismicity and features in the DPM. 
The lineations often localize to a width of roughly 
100 to 200 m, which is comparable to the un- 
certainty of the hypocenter locations (11). The 
general absence of orthogonal faulting in the 
northwest section of the rupture may indicate 
that this part of the fault zone is slightly further 
along in the evolutionary process. Numerous 
horsetail structures are oriented at an oblique 
angle to the main rupture trace at the northwest 
rupture terminus. 

Nearly all of the geometric features de- 
scribed are present in the seismicity as well 
as the InSAR surface deformation, indicating 
that they are representative of the fault struc- 
ture over the entire seismogenic zone. As defined 
by seismicity, the bottom of the seismogenic 
zone lies around 10 km at its deepest extent, 
with the maximum depth slowly decreasing to 
about 5 km toward the southeast (Fig. 2). Such 
a shallow seismogenic zone is also present in 
the Salton Trough region of California and is 
often attributed to the elevated heat flow (13); 
similar heat flow anomalies exist near the 
Ridgecrest sequence (1/4) and may control 
the boundaries of the brittle-ductile transi- 
tion zone. 

On the basis of kinematic subevent inver- 
sion of seismograms from the dense regional 
seismic network and global seismic stations, 
the M,, 6.4 earthquake had a duration of about 
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12 s, with three subevents best explaining the 
data (Fig. 4). These three subevents coincide with 
at least three faults. The 6-km-long northwest- 
trending fault slipped first, with an equivalent 
M,, 6.1 (subevent El). This is consistent with 
the hypocenter of this event, which is located 
about 2 km northwest of the long southwest 
trending fault. Thus, a rupture propagated over 
a short southwest-trending fault with only about 
5 km of surface break, yet the equivalent mag- 
nitude for this segment, was the M,, 6.2 sub- 
event (E2). Our analysis of this subevent indicates 
that it is required by the data and that the 
magnitude is well resolved (figs. S1 to S4). 
However, this subevent may also include slip 
on an adjacent northwest-trending orthogonal 
fault. Because the seismicity on the E2 segment 
extends across the main fault of the main- 
shock (Fig. 2), part of the slip probably oc- 
curred on the other side, but only at depth. 
The rupture then jumped to a larger south- 
west-trending fault that is about 15 km long, 
producing again an equivalent seismic mo- 
ment to an M,, 6.2 earthquake (E3) and a large 
surface rupture. 

Additional insights into the rupture process 
of the M,, 6.4 earthquake are given by a joint 
inversion of the geodetic data spanning both 
events and constraints provided by the relo- 
cated seismicity (Fig. 5 and fig. S10). On the 
faults closest to the three kinematic subevents, 
there is appreciable slip in our model. This event 
nucleated at the intersection between two faults 
trending northwest and southwest (Fig. 5). The 
15-km-long southwest-oriented left-lateral fault 
that aligns with subevent E3 (Fig. 4) produced 
a maximum of about 5 m slip. The large slip 
patches on these faults are coincident with re- 
gions of decreased aftershock density (fig. S10). 

Our kinematic subevent model of the My, 
7.1 mainshock shows that the rupture lasted 
for about 22 s and can be explained by four 
subevents (Fig. 4). The initial 5 s of the rupture 
had little moment, whereas the subevent with 
the largest moment (subevent E1, /,, 6.9) oc- 
curred close to the hypocenter between 5 and 
10 s. This subevent appears to have ruptured 
bilaterally, given its large seismic moment. 
Centroid locations of the later subevents propa- 
gated very slowly to the southeast, suggesting 
a unilateral rupture. The final subevent, E4 (at 
20 s), occurred ~25 km southeast of the first 
subevent E1 (at 7 s), indicating a slow average 
rupture velocity of roughly 2 km/s. 

The M,, 7.1 earthquake nucleated about 10 km 
to the northwest of the My, 6.4 event and rup- 
tured a fault network with a cumulative length 
of about 65 km. Most of the slip occurred near 
the hypocenter (Fig. 5), where an ~30-km-long 
patch produced a maximum of about 9 m slip 
at the scales resolved by our model. This part 
of the rupture is notably deficient in after- 
shocks relative to the rest of the rupture (fig. 
S10), a feature that is common to large slip 
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Fig. 1. Overview of study area. Mapped faults are indicated in brown. Ridgecrest sequence epicenters are 
shown as blue dots for time period up to the M,, 7.1 mainshock and as red dots following the mainshock. 
Santa Barbara, Los Angeles, Ventura, Palm Springs, and Ridgecrest and the LLFZ are denoted SB, LA, V, PS, 
RC, and LLFZ, respectively. Inset shows magnitude-time evolution of the first three days of the sequence. 
Note the catalog incompleteness during the first few days after the largest events. The M,, 6.4 event was 


preceded by a prominent foreshock sequence. 


regions of many other earthquakes (15). Slip in 
the model has a maximum depth that is close 
to that of the seismicity (~10 km). We found 
that both strands of the bifurcation ruptured 
for about 12 km, terminating less than 5 km 
from the Garlock fault. These observations agree 
with the extent of seismicity. There is evidence 
from the seismicity for one or more additional 
subparallel faults on the southwest side of these 
two (Fig. 2). 

The faults that ruptured during the M,, 7.1 
event appear to be distinct from those that 
produced the M,, 6.4 event, which likely in- 
dicates that no fault ruptured the same area 
twice. The initial portion of the M,, 6.4 rupture 
propagated to the northwest about 6 km and 
terminated about 4 km from the eventual M,, 
7.1 hypocenter (Fig. 4 and fig. S7). This 4-km gap 
was progressively filled by a series of moderate- 
sized earthquakes in the 34 hours after the My, 
6.4 event, which suggests that this portion of 
the fault acted as a barrier through which the 
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M,, 6.4 rupture was unable to propagate (Fig. 
4 and fig. S7). However, from the DPM (Fig. 
3B), it is clear that this 4-km-long segment 
did rupture the surface at some point during 
the sequence, which most likely occurred during 
the M,, 7.1 event. The M,, 7.1 event appears to 
have been triggered because steady seismic ac- 
tivity eroded away this barrier. Furthermore, 
from our modeling of the source process of the 
M,, 6.4 event, this foreshock either jumped 
across the primary fault that failed during the 
M,, 7.1 event or ruptured through it in an orthog- 
onal direction, yet it was somehow unable to 
trigger this large fault at that time. Instead, the 
M,, 7.1 event nucleated more than 10 km away 
from this intersection after being weakened by 
a series of M,, >4 earthquakes over 34 hours. 
The dynamic rupture conditions that could allow 
for such a scenario are not clear. 

In addition to direct aftershocks that occurred 
in the LLFZ, the Ridgecrest sequence initiated 
three sizable earthquake swarms. One swarm 
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was activated on the Garlock fault by the M,, 
6.4 event and locates in a pull-apart structure 
about 50 km southwest of the Ridgecrest se- 
quence (fig. S9). More than 4000 events with 
an M of >0 occurred in this swarm during the 
first 3 weeks, the largest of which was Mj, 3.2 
(fig. S9). This area has seen only minor seismic 
activity over the past several decades. Another 
swarm began north of the Coso geothermal 
field (Fig. 1), where similar behavior was ob- 
served following the Landers earthquake (6), 
with a 15-km gap over the geothermal produc- 
tion area (17). A third swarm was triggered in 
Panamint Valley. 

On the section of the Garlock fault south of 
the rupture terminus of the Ridgecrest main- 
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shock, InSAR data reveal an ~30-km zone of 
left-lateral, triggered, shallow creep (Fig. 6). 
The largest surface offset is around 20 mm of 
relative motion in the satellite line-of-sight 
direction, directly on the bearing of the My 7.1 
rupture. The narrow extent of the deformation 
field suggests that the creep is confined to the 
upper few hundred meters. This creep is sim- 
ilar to shallow creep induced by other major 
earthquakes (18). Although the Garlock fault 
has been seismically quiescent during the his- 
toric period, it has hosted numerous large 
earthquakes during the previous several thou- 
sand years (19). However, previous geodetic 
measurements have shown no measurable 
creep on the Garlock fault (20). 
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Fig. 2. Map view of Ridge- 
crest seismicity. Black lines 
indicate the surface trace of 
the fault (31), and purple lines 
indicate quaternary faults. 
Events with My, of >4.5 are 
indicated by focal mecha- 
nisms (32). The fault network 
exhibits multiscale orthogonal 
faulting throughout the 
region, with a bifurcation to 
the southeast and horsetail 
faulting at the northwest 
terminus. The lower panel 
shows a seismicity cross 
section for events within 1 km 
of A-A' with interpreted 
faults drawn shown as brown 
lines. At least 20 orthogonal 
faults cut through this 
profile. The dashed red line 
indicates the surface trace 

of southwest-trending 

fault that ruptured in the 

My 6.4 event. 


-117.20° 
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Discussion 

Most of our knowledge about the structural 
architecture of fault zones comes from obser- 
vations made on the surface, which consist 
primarily of geological mapping of faults and 
geodetic observations of coseismic deforma- 
tion. At depth, our understanding of proper- 
ties such as the geometry of fault zones is far 
less complete, with most evidence suggest- 
ing a general tendency for structural com- 
plications like damage zones to localize and 
simplify with depth (27). For the Ridgecrest 
earthquakes, we find that nearly every aspect 
of the surface geometry persists at depth, 
including the bifurcation of the M,, 7.1 rup- 
ture to the southeast, the horsetail faulting 
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at the northwest terminus, the multiple seg- 
ments that ruptured during the M,, 6.4 rup- 
ture, and the rampant orthogonal faulting. 
Below a 6-km depth, the main faults that rup- 
tured during the M,, 7.1 event may become 
slightly more planar. The smaller orthogonal 
faults produced events primarily with M of 
<3, whereas events with M of >3 are mostly 
associated with the largest structures (fig. 
S15). These results show that precisely re- 
located seismicity catalogs are of sufficient 
resolution to constrain fault geometry and even 
some aspects of the rupture process at depth. 

Orthogonal faulting occurring over a large 
region, at multiple length scales, implies that 
this is a primary characteristic of this fault 
zone. The smallest of these structures, having a 
length of 1 km or less, are probably being slowly 
generated as part of the ongoing formation 
of this young fault zone. Assuming a Mohr- 
Coulomb failure criterion, if two orthogonal 
planes are equally favorable, then the coeffi- 
cient of friction must be close to zero, imply- 
ing that the rocks are held together by cohesive 
strength alone (22). Alternatively, numerical 
studies have shown that cross-faults can be 
produced as a dynamic rupture effect near 
rupture edges (23). Ifa fault zone has widespread 
cross-faulting, the geometry could have a major 
influence on the earthquake rupture process, 
because at any point in time there will be many 
potential pathways to sustain the rupture prop- 
agation, thereby facilitating the occurrence of 
multifault ruptures. 

The Ridgecrest sequence has many similar- 
ities with the 1987 Superstition Hills sequence 
(24), which also exhibited prominent orthogonal 
faulting and had an M,, 6.2 foreshock followed 
by a M,, 6.6 mainshock. In that sequence, the 
two largest events formed an L-shaped geom- 
etry, similar to the faults that ruptured during 
the Ridgecrest M,, 6.4 event. More generally, 
however, orthogonal faulting is present across 
much of southern California as well as Japan 
(25-28) and was also prominent in the 2012 
Indian Ocean sequence (29). 

The highly segmented nature of the Ridge- 
crest earthquakes suggests that the rupture 
process during these events could be more of 
a cascading phenomenon than a single contin- 
uous rupture front propagating along a fault. 
This behavior would depend at least partly on 
whether the faults are physically connected 
at depth. In a cascading rupture process, esti- 
mates of the average rupture velocity could be 
biased because the time of slip initiation for 
each segment is not a smoothly varying func- 
tion of space and time. More important, how- 
ever, is that the physics of the rupture process 
is entirely different in a cascading model, with 
ruptures potentially being modulated or driven 
by the seismic waves. 

In the Ridgecrest sequence, at least 20 faults 
with a length of ~2 km or larger ruptured, with 
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Fig. 3. Interferogram and DPM. (A) Coseismic interferogram derived from the ALOS-2 SAR image pair 
(2018-04-16 and 2019-07-08), showing the locations of surface ruptures. The line of site (LOS) is from 
ground to satellite. (B) DPM derived from coherence loss between pre- and postseismic Sentinel-1 SAR data. 
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Fig. 4. Kinematic summary of rupture processes. The M,, 6.4 foreshock ruptured three main faults. 
This event was followed by foreshock activity along a northwest-trending fault and eventually triggered the 
My 7.1 event. The mainshock had four main subevents and ruptured bilaterally. Both events have very slow 


rupture velocities of ~2 km/s. 


countless smaller faults that likely slipped too. 
Many of these ruptured the surface, while others 
are only visible at depth. The Ridgecrest earth- 
quakes are the latest large crustal earthquakes 
to exhibit multifault ruptures (7-5), which 


18 October 2019 


several decades ago were viewed as unlikely 
or outright impossible (7). Today, with the 
available resolution, this phenomenon appears 
to be more common than not. The persist- 
ence of multifault ruptures indicates that the 
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phenomenon is a critical element of the phys- 
ics of earthquakes. 

Furthermore, the regular occurrence of multi- 
fault ruptures has fundamental implications 
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for seismic hazard assessment. Accounting for 
all possible combinations of faults that could 
activate simultaneously during a single event 
or sequence is challenging, if not functionally 
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impossible. This task is made even more dif- 
ficult by the fact that, as demonstrated by the 
Ridgecrest sequence, our database of large faults 
in California is still incomplete, and the concept 
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Fig. 5. Static inversion of geodetic data. (A) Mean slip model of the Ridgecrest sequence, including the slip contribution from the My 6.4 and M,, 7.1 events. 

(B) Black lines show the fault geometries used in the slip. Arrows indicate the data (black) and model prediction (red) of nearby GPS offsets. The color map indicates 
the displacements in the unwrapped ALOS2 ascending track 65 coseismic interferogram along the corresponding line-of-sight (LOS) direction (black arrow). Model 
predictions and data residuals are provided in the supplementary materials (11). 


Fig. 6. Summary of triggered creep on the 
Garlock fault. Step function fit at the coseismic 
time for Sentinel 1, ascending track 64 InSAR 
time series, combined with gradient shading of the 
same field. The gradient shading reveals zones of 
surface offset along the Garlock fault. Fault 
perpendicular profiles reveal offset of up to 

20 mm in the line-of-sight direction, while the 
width of the deformation profiles suggest that 
offset on the fault is confined to shallow depths. 
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of how faults are defined should be revisited. 
Magnitude estimates of historical earthquakes 
from paleoseismology could therefore be biased 
downward for many such multifault ruptures, 
while also exposing the potential for incorrectly 
splitting a single complex rupture into multiple 
smaller ones. As Fig. 1 shows, although the San 
Andreas fault may be the single largest source 
of hazard in southern California, nearly all large 
earthquakes since the 1906 San Francisco earth- 
quake, over a century ago, have not occurred 
on the San Andreas fault, and many have been 
complex multifault ruptures. 


Conclusions 


The 2019 Ridgecrest sequence brought to an 
end the long earthquake silence in California. 
These events occurred within an immature fault 
zone and activated many orthogonal structures 
with lengths ranging from 1 km to more than 
10 km. The largest events each ruptured mul- 
tiple faults, a characteristic that has been re- 
peatedly observed for large crustal earthquakes 
in recent years. Such scenarios are difficult to 
forecast for seismic hazard assessment. The 
rupture of the Ridgecrest mainshock termi- 
nated only a few kilometers from the Garlock 
fault, yet only aseismic creep was triggered 
at the closest section of the fault. Far to the 
southwest, a sizable swarm ensued, whereas 
triggered seismic activity on the entire eastern 
portion of the Garlock fault was negligible. At 
such close proximity to the mainshock rupture, 
the stress changes imparted by the mainshock 
are substantial. The last major earthquake 
occurred ~400 to 500 years ago (30). Future 
investigations that integrate the observed 
phenomena on the Garlock fault with geologic 
and geodetic observations will be important 
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for understanding its contribution to seismic 
hazard in the Eastern California shear zone. 
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Genetic regulatory variation in populations informs 
transcriptome analysis in rare disease 


Pejman Mohammadi"****, Stephane E. Castel", Beryl B. Cummings®®, Jonah Einson*”, 
Christina Sousa*+, Paul Hoffman“, Sandra Donkervoort’, Zhuoxun Jiang®, Payam Mohassel’, 
A. Reghan Foley’, Heather E. Wheeler®"°, Hae Kyung Im®, Carsten G. Bonnemann’, 


Daniel G. MacArthur>, Tuuli Lappalainen?2* 


Transcriptome data can facilitate the interpretation of the effects of rare genetic variants. Here, we 
introduce ANEVA (analysis of expression variation) to quantify genetic variation in gene dosage from allelic 
expression (AE) data in a population. Application of ANEVA to the Genotype-Tissues Expression (GTEx) 
data showed that this variance estimate is robust and correlated with selective constraint in a gene. Using 
these variance estimates in a dosage outlier test (ANEVA-DOT) applied to AE data from 70 Mendelian 
muscular disease patients showed accuracy in detecting genes with pathogenic variants in previously 
resolved cases and led to one confirmed and several potential new diagnoses. Using our reference 
estimates from GTEx data, ANEVA-DOT can be incorporated in rare disease diagnostic pipelines to use 


RNA-sequencing data more effectively. 


arge reference databases of human exomes 
and genomes have enabled the charac- 
terization of genomic variation in human 
populations (7-3). These data have been 
used to summarize genic intolerance to 
damaging variants, where depletion of gene- 
disrupting variants (e.g., stop-gain variants) 
indicates deleterious fitness consequences 
, 4, 5). Such analyses are essential for prioritiz- 
ing rare and de novo coding variants that can 
underlie Mendelian disease and provide a ge- 
netic diagnosis for 25 to 50% of patients (6, 7). 
However, despite advances in DNA sequenc- 
ing, the search for rare disease-causing variants 
outside the coding sequence has been hindered 
by the difficulty of interpreting rare regulatory 
variants and identifying their target genes. 
Integration of genome- and transcriptome- 
sequencing data has provided improved diag- 
nosis by better detection of rare variants with 
functional effects (6, 8-10). However, the often 
laborious analysis is further complicated by 
the transcriptome being affected by the envi- 
ronment, disease state, and technical variation. 
This has made it challenging to quantify when 
an effect is genetic and beyond the normal 
population range. Thus, most analyses have 
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been limited to only a small fraction of var- 
iants that induce clear alterations in the tran- 
scriptome, such as total loss of expression and 
splice defects. 

One promising data type is the allelic ex- 
pression (AE), which measures the relative 
expression of the paternal and maternal haplo- 
type of a gene in an individual. Departure from 
equal AE, allelic imbalance, is largely unaffected 
by environmental and technical factors with a 
reported heritability of 85% (77), and therefore 
has a higher sensitivity to capture cis-acting ge- 
netic effects, including those induced by rare 
variants (6, 12-15). However, a quantitative frame- 
work for interpreting this data type to identify 
rare pathogenic variants has been lacking. 

Here, we quantify the effects of genetic 
regulatory variation in populations using a 
mechanistic model of cis-regulatory varia- 
tion. Specifically, for each gene, we estimate 
VG, the expected variance in the dosage that 
is due to interindividual genetic differences 
within a population. Next, we use V“ as a ref- 
erence to identify genes affected by potentially 
pathogenic regulatory variants in patients. 


Results 
Generative model for population allelic 
expression data and the ANEVA method 


Cis-regulatory variant effect sizes can be quan- 
tified with allelic fold change (aFC) (16). aFC 
has an analytical link to gene dosage, which 
would allow calculation of V¢ if all regulatory 
variants were known (supplementary materials, 
Egs. 1 to 7). In practice, we can use AE data to 
estimate the overall distribution of regulatory 
effects on a gene without having to identify 
these variants explicitly. Across individuals, 
AE data represent a series of comparisons be- 
tween the net expression effects of all variants 
on two random haplotypes at a time. A major 
complication for applications of AE data is 
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that, within a population, genes have diverse 
patterns depending on the properties of regula- 
tory variants present and the single-nucleotide 
polymorphism (SNP) used to measure the al- 
lelic expression (aeSNP; Fig. 1, A to D) (74, 15). 
We derived a generative model for population 
AE data under a realistic scenario in which a 
gene is regulated by several regulatory variants, 
of which only some are identifiable. Under this 
assumption, population AE data are described 
by a constrained mixture of binomial-logit- 
normal (BLN) probability distribution func- 
tions (supplementary materials, Eqs. 8 to 19). 
We fit this model to population AE data (sup- 
plementary materials, Eqs. 20 to 28, and Fig. 1, 
E to H) and use the maximum likelihood pa- 
rameters to estimate V° indirectly (supplemen- 
tary materials, Eqs. 29 and 30). We refer to this 
method as analysis of expression variation 
(ANEVA). Simulations show that the inferred 
V“ is accurate (R” = 0.92; fig. S1). Thus, ANEVA 
allows biologically interpretable estimates of ge- 
netic variation in gene expression within a pop- 
ulation to be derived from AE read count data. 


ANEVA estimates from AE data are consistent 
with eQTL data and heritability of 
gene expression 


We applied ANEVA to 10,361 RNA-sequencing 
(RNA-seq) samples from 48 tissues and 620 
individuals with whole-genome sequencing 
(WGS) data from the Genotype-Tissues Expres- 
sion (GTEx) version 7 data (17, 18). Overall, we 
estimated V° at a median of 43,219 autosomal 
aeSNPs per tissue. Gene-level V° was derived 
as a weighted harmonic mean of SNP-level 
estimates for a median of 4962 genes per tis- 
sue and a total of 14,084 genes (Fig. 2, A and 
B, and table S1). First, we ensured that our 
AE-derived estimates were consistent with 
what is expected from expression quantitative 
trait locus (e@QTL) data [median correlation = 
0.73; Fig. 2C, fig. S2, and table S2]. Next, we 
benchmarked ANEVA estimates against gene 
expression cis heritability (h”). For GTEx whole 
blood, we calculated the ratio of AE- and eQTL- 
derived V’“ to the total variance of gene expres- 
sion. These ANEVA-based h” estimates were 
consistent and comparable to those from studies 
using standard methods and with larger data- 
sets, confirming that V° measures the genetic 
variation in gene expression (Fig. 2D and fig. 
$3). Because AE-based ANEVA V “ estimates are 
more applicable to AE-based outlier detection, 
we used these estimates for all subsequent 
analyses (fig. S14) (79). 


Genetically driven variation in gene expression 
across tissues, populations, and gene sets 


Next, we analyzed how V° varies between tis- 
sues and populations. The estimates were well 
correlated between tissues (median correla- 
tion = 0.57; Fig. 3A). For a given gene, V@ 
tends to be smaller in tissues where the gene is 
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more highly expressed (Wilcoxon signed-rank 
test p < 10°: Fig. 3B). Because this was not 
an artifact of differences in read depth (fig. $4), 
it suggests that there is an increased dosage 
sensitivity and a higher selective constraint in 
tissues where the gene has a more pronounced 


functional role (see fig. S5 for an example). To 


Fig. 1. Cis-regulatory variation, 
3500 


analyze population differences in V“@, we used 
ANEVA on AE data from three European and 
one African subpopulation from the GEUVADIS 
data (20). We found a high correlation between 
estimates from all subpopulations (correlation 
range: [0.75, 0.83]; fig. S6). This suggests that 
the total amount of genetic dosage variation is 


allelic expression, and ANEVA. 
(A to D) Examples of allelic 
expression across individuals 
(dots) for four genes with a single 
aeSNP each. In (A), similar 
haplotype expression levels for the 
gene indicate little cis-regulatory 
variation. In (B) to (D), there is 
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driven by different haplotype 
combinations of a common, 
strong regulatory variant and the 
aeSNP, with strong linkage 
disequilibrium in (D). These 
examples illustrate the challenge 
of consistently modeling the 
underlying regulatory variants. 
Ref., reference; Alt., alternative. 
(E) Schematic representation 

of ANEVA, which uses a generative 
model of population AE data 

and a mechanistic model of cis- 
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not highly variable between populations, and 
approaches that aggregate genetic effects at 
the gene level may have better applicability 
across populations than analyses of individ- 
ual variants. 

To characterize differences in the amount 
of genetic regulatory variation between genes, 
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diagram (H) (supplementary materials, Eqs. 20 to 22). AE data are modeled with one distinctly strong regulatory biallelic variant. If present, this variant is specified by its effect 
size, Sy, and a measure of linkage disequilibrium (LD) with the aeSNP. Residual cis-regulatory variation is modeled as an infinite-allelic regulatory variant summarized by variance 
term o?. Allelic expressions eg and e, are measured at a heterozygous aeSNP with reference (R) and alternative (A) alleles, and s*g.q is the aeSNP reference allele alignment 
bias. Haplotypes h; and he, basal expression level eg, and N cis-regulatory variant sites vy...vjy, are components of our complete formal model of cis-regulatory variation. 
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Fig. 2. Estimates of genetic regulatory variation in GTEx. (A) Number of genes with V° estimates across one to 49 GTEX tissues. Min., minimum. (B and C) Distribution 


of SD°, VV&, for 7556 genes in GTEx subcutaneous adipose (B) and its comparison with eQTL data [(C); correlation (corr.) = 0.71]. The red line is Deming regression 

fit (fig. S2). SD° is capped at 0.5 for visualization. (D) Benchmarking of ANEVA by gene expression heritability (h*) estimates. GTEx h? was calculated by the linear 
mixed-model based BSLMM, PrediXcan R?, and ANEVA (19). These were compared with two larger cohorts: BLSMM h? from the Depression Genes and Networks 
cohort (DGN) cohort [n =922 (32)] and local identity-by-descent (IBD)-based h* from the Iceland family blood (IFB) cohort [n = 722 (33)]. 
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we correlated V to statistics of gene regula- 
tion and constraint. For each gene, we calcu- 
lated a weighted harmonic mean of V“ across 
tissues Vv; table S1). Gene enhancer size had 
a minimal correlation to V“ (Fig. 3C) (2D, sug- 
gesting that the size of the mutational target, a 
proxy for the background mutation rate, plays 
a minor role. Genes with high purifying selec- 
tion for coding gene-disrupting variants or non- 
coding variants in the promoter or untranslated 
regions (UTRs) were depleted of genetic regu- 
latory variation (Fig. 3C), as previously observed 
by eQTL analysis (7). Rare disease genes had 
lower V'", whereas loss-of-function tolerant 
genes had higher fa (Fig. 3D), showing that 
dosage sensitivity is captured by both exome 
and regulatory variation analysis. Genes identi- 
fied by genome-wide association studies (GWAS) 
showed little deviation from the background, 
but schizophrenia genes had the lowest V and 
blood metabolite genes the highest, suggesting 
alink to genetic architecture of these traits. The 
amount of genetic regulation variation mea- 
sured as V@ can complement previous coding 
and regulatory variation analyses of selective 
constraint on genes and traits. 


Genetically driven variation in gene expression 
and dosage outlier testing from AE data 


In addition to these biological insights, V“ has 
a direct practical application in identifying 
population outliers that may be pathogenic. 
To this end, we developed the ANEVA dosage 
outlier test (ANEVA-DOT) to identify genes 


Fig. 3. Biological sources of 
regulatory variation between genes. 
(A) Correlation (corr.) of genetic 
regulatory variation across GTEx 
tissues (see table S1 for tissue names). 
(B) Rank correlation between median 
expression in a tissue and V° for 

9158 genes with V° estimates in at least 
five tissues. The distribution is shifted 
(median rank correlation -0.20). 
Significant genes are shown in red 
(5% FDR). (C) Rank correlation of ve 
with enhancer size, coding constraint 
(RVIS, pLl), noncoding constraint 
(ncRVIS), and noncoding conservation 
(ncGERP) in UTRs and promoters. 


(D) V° for different gene sets 

(DD, developmental disorder; CHD, 
congenital heart disease; MDM, 
congenital muscular dystrophies and 
myopathies; table S3), with nominal 
p-values from rank-sum test compared 
with the background of all genes 

(p < 0.01 highlighted) and the number 
of genes in parentheses. Boxes span the 
middle 50% values and the whiskers 
span +15 interquartile range (IQR) from 
first and the third quartile. 
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likely affected by a heterozygous genetic var- 
iant with an unusually strong effect on gene 
dosage. Using VS for each gene, ANEVA-DOT 
tests against the null hypothesis that the ob- 
served allelic imbalance in an individual is 
consistent with dosage variation in the gen- 
eral population (Fig. 4A) while accounting 
for a number of additional technical and bio- 
logical sources of variation (supplementary 
materials, Eqs. 31 to 42). We used extensive 
simulations to ensure that the test is well cali- 
brated (fig. S7). ANEVA-DOT is implemented 
in R and runs in a few seconds per sample (22). 

We first tested ANEVA-DOT in the general 
population of 466 skeletal muscle samples 
from GTEx. Each sample had a median of 
3390 genes tested and 10 genes identified as 
outliers at a 5% false discovery rate (FDR) 
(hereafter referred to as ANEVA-DOT genes; 
90% range: [3, 22]). An average of 56% of the 
genes previously implicated in neuromuscular 
disorders (6, 23) and up to 46% of the highly 
expressed genes were testable per individual 
(fig. S8). As a quality filter, 113 out of 5848 
tested genes that appeared as outliers in >1% 
of the individuals were excluded from further 
analysis (fig. S8, D to F, and table S4). After this 
step, a median of 4.5 ANEVA-DOT genes were 
retained per individual (90% range: [1, 14]; Fig. 
4B). ANEVA-DOT genes were highly enriched 
for rare heterozygous variants in a 10-kb win- 
dow upstream of the transcriptional start site 
and in the gene body (Fig. 4B). This enrich- 
ment was particularly pronounced for rare 


putative gene-disrupting variants that are ex- 
pected to have a strong effect on gene expres- 
sion levels by nonsense-mediated decay (Fig. 
4, B and C). This confirms that ANEVA-DOT 
captures rare genetic effects on gene dosage. 

Next, we evaluated how sensitive ANEVA- 
DOT is to differences in the reference popula- 
tion where V° is calculated. First, using the 
GEUVADIS data (20), we looked for ANEVA- 
DOT genes in 86 European (GBR) individuals 
using V° estimates derived from two European 
populations (FIN and TSI) and one African 
population (YRI). The three reference popula- 
tions performed similarly, with an average of 
74% (range 69 to 78%) of ANEVA-DOT genes 
identified using one confirmed by another (fig. 
89), suggesting that the lack of full concor- 
dance is likely driven by noise and threshold 
effects. However, larger sample sizes will be 
needed for a comprehensive evaluation of 
the population effects. Next, we investigated 
whether ANEVA-DOT genes in GTEx skeletal 
muscle could be identified by analyzing other 
accessible tissues of these individuals. The 
detection rate varied from 23.3% in fibroblast 
to 12.3% in whole blood, which indicates that 
ANEVA-DOT can capture some outlier effects 
also from proxy tissues (fig. S10). 


ANEVA-DOT accurately identifies disease genes 
in AE data from rare disease patients 
To test ANEVA-DOT’s performance in the di- 


agnosis of rare disease patients, we applied it 
to AE data from 70 rare Mendelian muscle 
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dystrophy and myopathy (MDM) patients 
using the V° reference from GTEx skeletal 
muscle (figs. S11 to S17, and table S5). Of the 
65 patients with high-quality data, 32 hada 
previous diagnosis, of which 21 were expected 
to lead to allelic imbalance (6). These cases 
were used as positive controls to benchmark 
ANEVA-DOT against previous tests of allelic 
imbalance: binomial and beta-binomial tests, 
binomial test with an allelic imbalance thresh- 
old, and a naive population-aware test of ex- 
cess allelic imbalance against GTEx data using 
the g-test (Fig. 4, D to H, and fig. $12). ANEVA- 
DOT identified a median of 11 outlier genes 
per individual (out of a median of 2190 tested), 
substantially fewer than other tests, (Fig. 4H). 


This small number of outliers always included 
the previously diagnosed gene when there was 
a detectable allelic imbalance present (76%; 
figs. S11 and S12), typically (69%) among the 
top five most significant genes (table S5). 
ANEVA-DOT’s high recall and precision out- 
performed all of the other tests by a substan- 
tial margin (Fig. 41 and figs. S12 to S14) (19). 
In the 33 patients without a genetic diag- 
nosis from previous whole-exome sequencing 
(WES) and/or WGS or RNA-seq analysis (6), 
we found a median of nine ANEVA-DOT genes 
per sample (in total 349 genes), which included 
at least one neuromuscular disease gene (6, 23) 
in 12 patients (in total 17 genes; figs. S15 and 
S16). One of these potential new diagnoses 


from ANEVA-DOT was confirmed: Patient N10, 
with a limb-girdle muscular dystrophy-like 
phenotype, had 13 ANEVA-DOT genes, with the 
one known Mendelian muscle disease gene, 
DES, being the most statistically significant. 
Further RNA-seq and reverse transcription 
polymerase chain reaction analysis identified 
a pseudo-exon insertion caused by a variant 
creating an intronic splice site. This had been 
missed by the prior gene panel, WES, WGS, 
and RNA-seq analysis because of challenging 
in silico interpretation of intronic variants and 
the relatively low number of RNA-seq reads. 
The variant is in trans with a pathogenic mis- 
sense variant that had not been identified as 
a diagnosis because of the lack of a second 
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Fig. 4. Regulatory outliers detected with the ANEVA-DOT. 
the ANEVA-DOT method. For each gene, the null distribution of allelic imbalance 
is estimated using the V° and the model of cis-regulatory genetic effect. 
Allelic counts in a test individual are compared with this null, accounting 

for sampling noise, sequencing noise, reference bias, and the variant haplotype. 
(B and C) Enrichment of all rare variants in ANEVA-DOT genes as a function 
of allele frequency (B) and for putative gene-disrupting variants [minor allele 
frequency < 1% (C)]. (D to H) Example of AE data for all genes from one previously 
diagnosed muscle dystrophy patient (N13). The disease gene is shown in 
blue. Outlier genes identified by different tests (5% FDR) are marked in red: 
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A) Illustration of 
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binomial [n = 387 (D)], binomial with a 15% allelic imbalance threshold [Bin-Thr, 
n = 246 (E)], beta-binomial [Beta-Bin, n = 83 (F)], excess allelic imbalance 
against GTEx data by z-test [Al z-test, n = 94 (G)], and ANEVA-DOT [n = 15 (H)]. 
Genes marked in gray are excluded from each test. (I) Fraction of true causal 
genes identified in previously diagnosed patients (recall) and its 95% bootstrap 
confidence intervals versus the number of outliers reported. Empirical recall 
(left) is calculated using all cases in which imbalanced AE would be expected 
(n = 21), whereas idealized recall (right) excludes five cases in which 
detecting the gene from AE data is impossible (e.g., when the causal gene 
is not expressed; fig. S12). 
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variant (fig. S18). Additionally, ANEVA-DOT 
identified strong candidates in six cases and 
possible candidates in 11 others (19). By de- 
sign, ANEVA-DOT does not rely on identify- 
ing which variant underlies the dosage-outlier 
effect, but genetic analysis can be applied af- 
ter prioritizing genes by ANEVA-DOT. This is 
currently mostly limited to gene- or splice- 
disrupting variants because of their easier 
annotation compared with rare regulatory var- 
iant candidates that may also exist. Overall, we 
expect up to 10.5 of the 17 known MDM and 
18.8 of all 349 identified ANEVA-DOT genes 
in the 33 undiagnosed patients to be true 
disrupted causative genes (/9). 


Discussion 


In this study, we introduce a method, ANEVA, 
and its extension, ANEVA-DOT, to quantify 
genetic variation in gene dosage in the gen- 
eral population, and to identify genes where a 
patient appears to carry a heterozygous vari- 
ant with an unusually strong effect on gene ex- 
pression. This enables individual transcriptome 
comparison to previously generated reference 
data without the caveats of technical and re- 
verse causation noise in total gene expression 
analysis. 

The ANEVA framework uses biologically 
interpretable units of gene dosage, allowing 
interpretation of regulatory and coding gene- 
disrupting variants on the same scale. Fur- 
thermore, the statistical methods introduced 
here for modeling allelic expression data are 
applicable to other uses of this data type. 

ANEVA-DOT is a fast and powerful approach 
for finding genes with likely disease effects, 
with the small numbers of outliers making 
further manual curation feasible in a clinical 
setting without compromising sensitivity. The 
use of V° estimates from GTEx as a shared 
reference for ANEVA-DOT analysis of patients 
is analogous to the use of coding constraint 
metrics for prioritization of pathogenic coding 
variants. ANEVA-DOT outlier genes can be 
further prioritized by candidate gene lists and 
by tools that are currently used in exome se- 
quencing follow-up (J, 2, 5, 24, 25). Because 
ANEVA-DOT captures transcriptome out- 
comes of genetic effects without having to 
identify rare regulatory variants themselves, 
this method is particularly advantageous for 
rare genetic effects from poorly defined reg- 
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ulatory elements, but it will also detect, for 
example, variants triggering transcript de- 
cay. However, identifying the specific variants 
underlying ANEVA-DOT outliers is still chal- 
lenging despite existing variant prioritization 
approaches, especially for noncoding regions 
(26-28). 

Despite these advantages, our methods have 
several limitations. The main caveat is that AE 
data are sparse, and V estimates may be lack- 
ing or noisy for genes with few common cod- 
ing variants owing to small size, high coding 
constraint, or low expression levels. These issues 
will, however, improve with increasingly large 
RNA-seq datasets. ANEVA-DOT is only appli- 
cable to about half of expressed genes per 
individual that have an aeSNP. Finally, allelic 
imbalance is not informative of recessive 
effects without family analysis. Thus, like other 
genetic diagnosis tools, ANEVA-DOT should 
be used in conjunction with other methods to 
capture different types of rare variants un- 
derlying disease. We envision that in clinical 
genetics, when practically feasible, transcrip- 
tome data will become a powerful additional 
layer of data for interpreting the genome and 
its disease-contributing variants. 
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EXOPLANETS 

Oxygen fugacities of extrasolar rocks: Evidence for 
an Earth-like geochemistry of exoplanets 

Alexandra E. Doyle!*, Edward D. Young!*, Beth Klein?, Ben Zuckerman’, Hilke E. Schlichting? 


Oxygen fugacity is a measure of rock oxidation that influences planetary structure and evolution. 

Most rocky bodies in the Solar System formed at oxygen fugacities approximately five orders of 
magnitude higher than a hydrogen-rich gas of solar composition. It is unclear whether this oxidation of 
rocks in the Solar System is typical among other planetary systems. We exploit the elemental 
abundances observed in six white dwarfs polluted by the accretion of rocky bodies to determine the fraction of 
oxidized iron in those extrasolar rocky bodies and therefore their oxygen fugacities. The results are 
consistent with the oxygen fugacities of Earth, Mars, and typical asteroids in the Solar System, suggesting 
that at least some rocky exoplanets are geophysically and geochemically similar to Earth. 


stimating the composition of extrasolar 
planets from host-star abundances or from 
planet mass-radius relationships is difficult 
and unreliable (/, 2). The elemental abun- 
dances in some white dwarfs (WDs) pro- 
vide an alternative, more direct approach for 
determining the composition of extrasolar rocks. 
WDs are the remnant cores left behind when 
a star ejects its hydrogen-rich outer layers after 
the red giant phase. These remnant cores are 
~0.5 Mo (solar masses) and about the same 


We use the relative abundances of rock- 
forming elements in polluted WDs to determine 
the effective partial pressure of oxygen, i.e., the 
oxygen fugacity (fo,) of the accreted rocks. 
Oxygen fugacity is a measure of the degree of 
oxidation in the rocks. It corresponds to the 
effective partial pressure of gaseous oxygen that 
would be in thermodynamic equilibrium with 


100% 


the material of interest. In combination with 
other factors, the intrinsic oxygen fugacity of 
a planet will determine the relative size of its 
metallic core, the geochemistry of its mantle 
and crust, the composition of its atmosphere, 
and the forces responsible for mountain build- 
ing (17, 18). Oxygen fugacity is also thought to 
be among the parameters that determine the 
habitability of a planet (19). In practice, fo, is 
usually expressed as the nonideal partial pres- 
sure of oxygen relative to a convenient refe- 
rence value. 

Oxygen fugacities of rocky planets are often 
reported relative to the reference Iron-Wiistite 
(IW) equilibrium reaction Fe (Iron) + % Os = 
FeO (Wiistite), such that AIW = log(fo, )— 
log(Jo, rw (16). When expressed this way, 
differences in oxygen fugacity are nearly inde- 
pendent of temperature and pressure (16). The 
initial oxidation state of a rocky body with at 
least some Fe metal at the time of its forma- 
tion is recorded by the concentration of oxi- 
dized iron (hereafter denoted as FeO, although 
it may include other oxides of iron) in the rock 
and the concentration of Fe in the metal 


glock rock 
AIW = 2iog( =) atog( Ee) (1) 
Xe YFe 


radius as Earth, are no longer powered by fusion, BAI 
and slowly cool over time. Because of their high 90% 4 mCa 
densities, and thus strong gravitational fields, Si 
elements heavier than helium rapidly sink below 80% Mg 
their surfaces, becoming unobservable. None- | Fe 
theless, spectroscopic studies show that the a HO 
atmospheres of up to half of WDs with effec- 60% - 

tive temperatures <25,000 K are “polluted” by 

elements heavier than He (3-5). The source of 2 50% 

these heavy elements is exogenous, coming from s 

accretion of debris from rocky bodies that pre- 40% -| 

viously orbited the WDs (6-9). We exploit this 

pollution to measure the elemental constituents ate) 

of extrasolar rocky bodies. We collated observa- 20% -| 

tions from the literature of polluting elements 

in six WDs: SDSS J104341.53+085558.2 (10), 10% 

SDSS J122859.92+104033.0 (9), SBSS 1536+520 

(1D, GD 40 (8, 12), SDSS J073842.56+183509.6 0% ee a 

(13), and LBQS 1145+0145 (14) (hereafter, SDSS * ® & & FES 2 RS Se ea RS RG es £ g® & ry s 
J1043+0855, WD 1226+110, WD 1536+520, GD 40, SMH TN HP GP MP Pw” 

SDSS J0738+1835, and WD 1145+017, respec- ss e mS VO SS RY oS RS RS 

tively). Their coordinates are listed in table S1. es SX S 


The bulk compositions of the bodies polluting 
these WDs resemble those of rocky bodies in 
the Solar System (15, 16) (Fig. 1). 
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Fig. 1. Bulk compositions by mass for six white dwarfs compared with Solar System bodies. Bulk 
compositions of the six rock-forming elements Al, Ca, Si, Mg, Fe, and O are indicated by the colored bars. 
The six white dwarfs are shown in the right-most columns. Shown for comparison are Solar System 
objects: the Sun, Comet Halley (1P/Halley), Earth, the Moon, Vesta, Mars, Mercury, three types of meteorites 
(enstatite chondrite, ordinary L chondrite, and carbonaceous Cl chondrite), and three terrestrial igneous 
rock types [mid ocean ridge basalt (MORB), lherzolite (representing Earth’s mantle), and bulk silicate 
Earth (BSE)] (16). The relatively high abundances of Fe in bulk Mercury and bulk Earth are due to their 
metal cores. The compositions of the white dwarfs are similar to the Solar System rocks. The large amount of 
O in WD 1145+017 is highly uncertain. Values are listed in data S1. 
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where xj are mole fractions of the species 7 
in phase k, vi are activity coefficients for the 
species, and thermodynamic activities are af = 
af ve. To facilitate comparison, we set the un- 
certain activity coefficients to unity, so the second 
term on the right-hand side of Eq. 1 vanishes. 
Equation 1 expresses the fo, at the time the 
planet or planetesimal formed (20, 27); we refer 
to this as the intrinsic oxygen fugacity of the 
body. The partitioning of iron between rock and 
metal during formation leaves a record of the 
intrinsic oxygen fugacity in the form of the 
mole fraction of FeO in the rocks, 22°. This 
signature persists even after the rock and metal 
are separated by the process of differentiation 
(partitioning between core and mantle). This 
is because changes in the valence state of iron 
during subsequent reactions proceed without 
appreciably altering the total amount of iron 
bonded to oxygen in the rocks. For example, 
the reaction 


2Fe”* Fest 0, + 6Fe?* SiO,P™™n = 
6Fe3 +O, vine + Oo (2) 


determines the Fe®*/Fe** ratio, and thus the 
Jo,, in a rock containing the minerals spinel, 
pyroxene, and olivine, without substantially 
altering the total Fe bonded to oxygen (from 
2.17 oxygens per Fe to 2.00 oxygens per Fe). 
Reactions like these that follow the formation 
of a rocky body lead to local variations in fo, 
within the body but do not generally alter the 
intrinsic oxygen fugacity recorded by applica- 
tion of Eq. 1 (Fig. 2). The intrinsic oxygen fu- 
gacity of Earth is constrained by amantle = 0.06 
[8 wt % (weight percent) FeO] in its mantle and 
the composition of its Fe-rich core. This 
leads to a terrestrial ATW value of about —1 to 
—2, with the range due to the uncertain values 
for the activity coefficient ratio [commonly 
used values of yreqr te /ype°" range from 
~1 to 4 (22)). 

The material accreted by the six polluted 
WDs in this study are rocks devoid of metal, 
as demonstrated by the lack of excess Fe rela- 
tive to oxygen (Fig. 1). Separation of metal and 
rock during accretion onto WDs is suggested by 
the ranges in element ratios in polluted WDs 
(23) and from observations of a metal-density 
planetesimal core orbiting a WD (24). Our fo, 
measurements are representative of the fugac- 
ity values at the time of core formation, even 
though they are derived from crustal or man- 
tle rocks. The maximum intrinsic oxygen fugac- 
ity calculated from Eq. 1 approaches 0 as the 
mole fraction of Fe in the silicate increases. 
Values for AIW greater than ~—0.9 for elemen- 
tal concentrations similar to Solar System rocks 
imply that all of the iron has been oxidized, and 
that the intrinsic AIW values from Eq. 1 are there- 
fore minimum estimates for the oxidation state 
at the time the rocks formed. 


Doyle et al., Science 366, 356-359 (2019) 


The solar protoplanetary disk must, on aver- 
age, have had the same composition as the Sun 
(see supplementary text in the supplemen- 
tary materials). The oxygen fugacity of a gas 
with solar composition is determined by its 
H,O/Hz ratio, after correcting for the oxygen 
bound to carbon in CO and other less abundant 
oxides, according to the reaction Hy + 4% O. = 
H,O (6). Studies of meteorites reveal that, like 
Earth, most rocky bodies in the Solar System 
formed with AIW approximately five orders 
of magnitude higher than that of a solar gas 
(25, 26) (Fig. 3). The presence of large amounts 
of iron bonded to oxygen in silicates in chon- 
drite meteorites indicates there was a relatively 
high oxygen fugacity during the earliest stages 
of rock formation in the Solar System (27). The 
enhancement in oxygen fugacity during rocky 
body formation may be attributable to the sub- 
limation of water-rich and/or rock-rich dust at 
high dust/gas ratios (28). In this context, we 
examine whether the processes that led to oxi- 
dation of rocks in the Solar System are typical 
of other planetary systems, and therefore whether 
the geophysical and geochemical character- 
istics of Earth are likely to be common among 
rocky exoplanets. 

When the six major rock-forming elements 
are measured in a polluted WD, the abundance 


+6 


+4 


+ 
ne) 


AIW (today) 


-2 Bulk Earth 


Bulk Mars 


of FeO may be used to determine the oxidation 
state of the accreted exoplanetary rocky bodies 
(2, 8, 29). Polluted WDs with observed abun- 
dances of O, Mg, Si, Fe, Al, and Ca can be used to 
calculate oxygen fugacities from Eq. 1 by recog- 
nizing that any Fe not bonded to oxygen must 
have existed as metal in the accreted bodies. 
Data for polluted WDs are preferable to elemen- 
tal abundances in other stars because of the 
known rocky provenance of the accreted ele- 
ments in WDs, especially in the case of oxygen. 

Our basic methodology is as follows: The 
oxide components MgO, SiO., FeO, Al,O3, and 
CaO describe the compositions of the major 
minerals that make up the accreting rocks. By 
assigning oxygen first to Mg, then Si, Al, Ca, 
and finally Fe, we calculate the relative amount 
of oxidized Fe, as FeO, and assign any remain- 
ing Fe to metal representing the core of the 
body (2, 8, 30). We propagate measurement 
uncertainties for the polluted WDs using a 
Monte Carlo bootstrap approach (16). 

We validated our method using Solar System 
bodies by converting the composition of these 
bodies into hypothetical polluted WDs, as if 
rocks from the bodies (e.g., Earth, Mars, Mer- 
cury) had accreted onto a WD. We used typical 
WD measurement uncertainties for these cal- 
culations and recovered the known intrinsic 


A Terrestrial Basalt 
(Terrestrial Andesite 
Oerrestrial Lherzolite 
© Terrestrial Harzburgite 
Y Terrestrial Gabbro 

<J Martian Basalt 


6 -4 -2 


AIW (at core formation) 


Fig. 2. Oxygen fugacities relative to IW at core formation versus today. Terrestrial and Martian 
rocks are characterized by AIW at the time of core formation, as calculated from the concentration 

of FeO, and AlW as measured today using various other measures of oxygen fugacity (16). Bulk Earth 
and bulk Mars values are also shown, demonstrating their similarity in intrinsic AlW at the time of 
core formation, despite the ranges in AIW as measured today. Error bars are lo (16). Where oxygen 
fugacities were previously reported relative to the quartz-fayalite-magnetite buffer (QFM), we 
converted them using AIlW = AQFM - 4. Andesite, basalt, and gabbro represent crustal rocks, whereas 
lherzolite and harzburgite, specific types of peridotites, are mantle rocks. 
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Fig. 3. Calculated oxygen fugacities relative to IW for rocky extrasolar bodies. Numbered circles 
show the values for rocky debris that polluted the white dwarfs: (1) SDSS J1043+0855; (2) WD 1536+520; 
(3) GD 40; (4) SDSS JO738+1835; (5) WD 1226+110; and (6) WD 1145+017. Values are listed in table S1. 
Error bars (1o) are from propagation of measurement uncertainties (16). Only an upper limit could be 
obtained for SDSS J1043+0855 owing to the measurement uncertainties relative to the Fe concentration 
for that star (16). The ranges of relative oxygen fugacities for a gas of solar composition (yellow) and 
for most Solar System rocky bodies (blue) are shown for comparison. Rocks from Solar System planets 
are also shown and are represented by their planet symbols: Earth (@), Mars (3), and Mercury (¥). 
Triangles show values for meteorites, representing bodies in the asteroid belt, including Vesta. The inset 
shows an example AIW probability distribution for one of the WDs, GD 40 (16); equivalents for the 


other WDs are shown in fig. S3. 


oxygen fugacities for Earth, Mars, Mercury, 
Vesta, and various chondritic bodies (16). The 
Solar System bodies span a range in AIW of 
~6 dex, in agreement with previous studies 
showing that Mercury and enstatite meteor- 
ites have fo, orders of magnitude lower than 
those for Earth, Mars, and other chondrite 
group meteorites (31, 32). 

The six WDs in this study were chosen be- 
cause quantitative measurements of all six 
major rock-forming elements are available for 
each. These WDs also exhibit infrared excesses, 
indicative of surrounding debris disks (6). The 
AIW values we obtain for the rocks accreted by 
the polluted WDs are all similar to those of 
Earth, Mars, Vesta, and the asteroids repre- 
sented by carbonaceous (C) and ordinary (O) 
chondrites in the Solar System (Fig. 3). 

In cases where there is both more oxygen than 
required to oxidize all other major elements and 
a commensurate amount of hydrogen, water in 
the accreting body is implied and the partition- 
ing of oxygen between ice and Fe can be am- 
biguous (33, 34). In addition, unaccounted-for 
Si in metal cores may have liberated oxygen to 
oxidize Fe in the accreted bodies. We find that 
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these effects are small for the WDs in this 
study and do not affect the elemental abun- 
dances we used to derive oxygen fugacities 
(16) (fig. $5). 

The high oxygen fugacities of these extra- 
solar rocks, relative to a solar gas, suggest that 
whatever process oxidized rock-forming mate- 
rials in the Solar System also operated in these 
other planetary systems. The large amount of 
oxidized iron in chondrite meteorites shows 
that oxidation relative to a solar gas occurred 
early in the Solar System, evidently before, or 
during the earliest stages of, planetesimal for- 
mation. Raising AIW by 5 log units, from solar 
to rock-like values, requires the gas to acquire 
an H,O/Hz ratio ~400 times that of a solar gas 
(28). This enrichment factor is greater than 
can be explained by simply transporting water 
in the form of ice particles from the outer to 
the inner Solar System (28). If dust/gas ratios 
control the oxidation states during rock forma- 
tion, we conclude that the Solar System and the 
planetary systems around these six polluted 
WDs had similar ratios. This implies that high 
dust/gas ratios are intrinsic to rock formation 
in protoplanetary disks. A similar composi- 
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tional link between planet formation in the 
Solar System and that around other stars is 
indicated by the depletion of carbon in both 
solar and extrasolar rocks (35, 36). 

The high oxidation state of these rocks 
determined the mineralogy, and therefore the 
geophysical behavior, of their parent bodies 
or the planets these bodies formed. For exam- 
ple, the lower mantle of Earth is composed 
of ~70% bridgmanite and ~20% magnesio- 
wiistite, two mineral phases with markedly 
different rheological properties whose abun- 
dances depend on fo,. The relative abundances 
of these minerals determine the dynamic be- 
havior of the mantle (37). For illustration, the 
influence of oxygen fugacity on the miner- 
alogy of a silicate mantle, and the composi- 
tion of a metal-rich core, can be described by 
the reaction 


M sO mascesiownstite core 


+ Si" + Oo = 
MgSiO,bdsmanite (3) 


where MgO refers to the Mg component in 
mantle magnesiowiistite [((Mg,Fe)O], MgSiO. 
refers to the Mg component (bridgmanite) in 
mantle silicate perovskite [(Mg,Fe)SiO;], and 
Si®"* refers to Si in the metal-rich core. Rear- 
ranging the equilibrium constant for the 
reaction in Eq. 3, gg, 2, Shows that the activ- 
ity ratio of bridgmanite to magnesiowiistite is 
expected to vary with oxygen fugacity 


bridgmanite 


Equation 4 also illustrates that the Si content 
of the core varies inversely with oxygen fugac- 
ity. The concentrations of Si and other light 
elements in the core likely play a role in driv- 
ing the compositional convection within the core 
that powers Earth’s magnetic field (38, 39), 
which affects a planet’s habitability (40). The 
relative size of the metallic core of a body (or 
even its existence) is also determined by oxygen 
fugacity (47). If the body or bodies that accreted 
onto WD 1536+520 were otherwise similar to 
Earth or its antecedents, the AIW value of -1.37 
would result in a planet with an Fe-rich metal 
core making up ~20% of the mass of the par- 
ent body. For comparison, the most highly oxi- 
dized bodies we found, with AIW ~ —0.6, would 
assemble to form a planet with no Fe-rich 
metal core. 

Our results show that the parent objects that 
polluted these WDs had intrinsic oxidation 
states similar to those of rocks in the Solar 
System. Based on estimates of their mass, the 
bodies accreting onto WDs were either aster- 
oids that represent the building blocks of rocky 
exoplanets, or they were fragments of rocky 
exoplanets themselves (15, 42). In either case, 
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our results constrain the intrinsic oxygen fu- 
gacities of rocky bodies that orbited the progen- 
itor star of their host WD. Our data indicate 
that rocky exoplanets constructed from these 
planetesimals should be geophysically and 
geochemically similar to rocky planets in the 
Solar System, including Earth. 
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3D PRINTING 

Rapid, large-volume, thermally controlled 3D printing 
using a mobile liquid interface 

David A. Walker"?*, James L. Hedrick”**, Chad A. Mirkin??3+ 


We report a stereolithographic three-dimensional printing approach for polymeric components that 
uses a mobile liquid interface (a fluorinated oil) to reduce the adhesive forces between the interface and 
the printed object, thereby allowing for a continuous and rapid print process, regardless of polymeric 
precursor. The bed area is not size-restricted by thermal limitations because the flowing oil enables 
direct cooling across the entire print area. Continuous vertical print rates exceeding 430 millimeters per 
hour with a volumetric throughput of 100 liters per hour have been demonstrated, and proof-of-concept 


structures made from hard plastics, ceramic precursors, and elastomers have been printed. 


dditive manufacturing has been a dis- 
ruptive force (J-7). Traditionally, it has 
been used for prototyping, substantially 
reducing the time from product ideation 
to manufacturing, but it has recently be- 
gun to make notable progress in the manufac- 
turing sector (8, 9). One promising approach 
for the printing of plastic objects is stereo- 
lithography (SLA), which uses photocurable 
liquid resins that can be chemically formu- 
lated to yield materials with a large range of 
attractive mechanical properties (10, 17). Con- 
ventional SLA operates by using ultraviolet 
(UV) light to cure liquid photoactive resins 
on a vertically moving plate; the result is a 
desired three-dimensional (3D) object con- 
sisting of stacked 2D layers (12). DeSimone 
and co-workers introduced a variation of SLA 
called continuous liquid interface printing, 
which uses oxygen inhibition to create a reac- 
tion “dead layer” (13, 14). This dead layer pre- 
vents adhesion between the emerging part 
and the bottom of the print vat, obviating the 
need to repeatedly mechanically cleave the 
part from the vat (72). This continuous print 
approach increases vertical print speeds by 
two orders of magnitude while simultane- 
ously removing material defects intrinsic to 
the aforementioned layer-by-layer lamination 
approaches. However, the polymerization re- 
actions used in SLA photoresins are highly 
exothermic, and at such print speeds, the chal- 
lenge of heat dissipation is daunting (15, 16). 
Moreover, the delivery of gaseous oxygen, a 
thermal insulator, through the print bed to 
create a dead layer limits one to peripheral 
cooling options that cannot rapidly dissipate 
the heat being generated. 
We report a dead layer-free approach to 
rapid SLA printing, HARP (high-area rapid 
printing), which is capable of continuously 
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printing over large areas and at rapid verti- 
cal print speeds. The printer operates on the 
principle of a UV-curable resin floating on a 
bed of flowing immiscible fluorinated oil to 
minimize interfacial adhesion at the build re- 
gion. Aizenberg and co-workers have demon- 
strated the great breadth of dewetting behavior 
possible when using fluorinated liquids em- 
bedded within solid matrices for applications 
in medicine and marine biofouling (17, 18). 
HARP builds on this concept by keeping the 
fluorinated phase in constant motion relative to 
the emerging printed part, further decreasing 
the adhesion forces (i.e., static versus dynamic) 
and generating a solid-liquid slip boundary. 
The oil can be recirculated through a heat ex- 
changer to cool or heat the build region and 
maintain thermostatic control across the entire 
print bed. Additionally, the oil can be contin- 
uously filtered to remove the microparticulate 


solids that are generated during the SLA print 
process and decrease resolution through light 
scattering (commonly referred to as “clouding”) 
(19). Finally, because HARP does not require an 
oxygen dead layer, it is compatible with both 
oxygen-sensitive and -insensitive ink chemis- 
tries, increasing the scope of applicable resins 
and resulting materials. 

HARP operates through a mobile liquid in- 
terface that creates a shear stress beneath the 
emerging part and results in a slip boundary 
(Fig. 1. The slip boundary allows for the so- 
lidified part to be continuously retracted from 
the print interface. Fluorinated oils (perfluoro- 
polyether copolymers, such as Solvay Fomblin Y 
or Chemours Krytox GPL) were chosen for their 
omniphobic properties and higher densities 
relative to that of common SLA resins. Although 
we explored other immiscible liquid systems, 
including densified water and glycerin, neither 
of these aqueous options produced the same 
quality of dewetting behavior with the emerg- 
ing 3D printed part; both resulted in lower- 
quality print production (see fig. S7). A pair of 
inlet and outlet manifolds were fabricated to 
distribute a laminar flow of oil across the print 
bed with a uniform velocity profile (fig. S1). This 
flow profile ensures that the oil layer remains 
optically uniform across the build platform 
(i.e., no turbulence giving rise to optical distor- 
tions at the oil/resin interface) and results in a 
uniform interfacial-shear stress being applied 
to all solidified parts. 

To confirm our hypothesis of a slip bound- 
ary being central to the HARP printer’s opera- 
tion, we used particle imaging velocimetry to 
analyze the cross-sectional fluid flow profile as 


Fig. 1. Flow profile of a A ; 
mobile interface that - aa Retracting Stage Gate 
enables continuous Manifold Manifold 


printing. (A) Scheme of 

a 3D printed part emerging 
from the HARP 3D printer. 
(B) Velocity profile under 
printed part at different flow 
speeds, demonstrating 

the presence of a slip 
boundary. Colors represent 
increasing volumetric fluxes q 
(red, g = 0.21 mm/s; orange, B 12 
q = 0.30 mm/s; green, g = 1.0 
0.44 mm/s; teal, g = 0.56 mm/s; 
blue, g = 0.66 mm/s; violet, 
q = 0.75 mm/s). Open circles 
are experimental data points 
from particle-imaging velo- 
cimetry; continuous lines 

are fits from an analytical 0.0 
model. (C) Scheme inset of 

the slip boundary flow profile 

under the part, with a repre- 

sentative experimentally observed flow profile. 
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the oil passed beneath a printed part (20). Slip 
boundary conditions are notoriously difficult 
to observe because of the randomized optical 
reflections that occur at the interfaces under 
study, resulting in noisy data (21, 22). To rectify 
this problem, we fit an analytical model, allow- 
ing for the possibility of either a slip or non-slip 
boundary condition, to the center of the flow 
profile (i.e., where high-quality data can be 
collected). The model allowed for a classical 
Navier slip boundary to arise when the inter- 
facial shear stress exceeded a given critical 
shear stress (23). From this analysis, the slip 
boundary model best captured the dynamics 
of the experimental data (Fig. 1B and fig. S3) 
(24). This is best reflected in the experimental 
velocity profiles [v,(y); circles in Fig. 1B] as a 
function of the oil flow rate (increasing volu- 
metric flux, from red to violet) and the corre- 
sponding Navier slip model fits (solid lines 
in Fig. 1B). 

In the absence of heat removal, rapid verti- 
cal print speeds of even modestly sized parts 
quickly exceed the smoke point of the resin. 
These extreme temperature swings (AT = 
>120°C) result in variable reactivities across 
the print bed, thermal warping/cracking of the 
printed material, and uncontrolled clouding 
that limits lateral resolution; ultimately these 
factors result in a deformed object and failed 
print (fig. S6) (24). To quantify the ability of 
HARP to dissipate heat, we performed ther- 
mal imaging of a small test print (5 cm x 5 cm 
cross-sectional dimensions) in a specially de- 
signed miniature print vat (i.e., low-profile 
walls to enable imaging across the bed). Three 
different flow scenarios were evaluated: (i) 
printing without a mobile interface or active 
cooling (Fig. 2A); (ii) printing with a mobile 
interface to actively dissipate heat, but with 
no cooling of the oil (Fig. 2B); and (iii) printing 
with a mobile interface and active cooling of 
the oil (Fig. 2C). Without flow to actively dis- 
sipate heat, the printed part quickly exceeded 
the sensor range of the infrared (IR) camera at 
150°C, and exceeded the flashpoint of the resin’s 
primary monomer diluent (1,6-hexanediol di- 
acrylate, 113°C). Similar experiments involving 
the generation of parts having larger cross- 
sectional areas (e.g., 20 cm x 20 cm) exceeded 
180°C (as evidenced by an IR noncontact ther- 
mometer gun measurement) in the absence of 
an active flow to dissipate the accumulating 
heat. In the case presented here, the accumu- 
lation of heat (Fig. 2A) led to surface temper- 
atures in excess of 150°C and the part cracked 
during the printing process, causing a physical 
displacement from the stage (observable in 
the final frame of Fig. 2A, indicated by white 
lines overlaid on the part; see also fig. S6). 
Alternatively, the flow of oil depicted in Fig. 2B 
dissipated this heat into an oil reservoir. Over 
time, the temperature of the reservoir increased, 
reducing its effectiveness. Last, in Fig. 2C the 
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20 


Fig. 2. IR thermal images of an emerging 3D printed part (hard polyurethane acrylate resin; cross 
section, 5 cm x 5 cm; vertical print rate, 120 m/s; optical resolution, 100 pm) under three different 
print conditions. (A) Stationary print interface. (B) Mobile interface. (C) Mobile interface with active 
cooling. Elapsed time between panels (left to right) is ~500 s; scale bars, 25 mm. Data and thermal color 


mapping correspond to movies S1 to S3. 


oil was actively cooled while flowing to help 
stabilize the reservoir temperature, as well as 
the surface temperature of the emerging part, 
at 100° to 120°C. 

To further evaluate the scalability of HARP 
through cooling, we printed a 38 cm x 61cm x 
76 cm part in 1 hour and 45 min with vol- 
umetric throughput of 100 liters/hour (fig. 
S4). At this scale, the fidelity of the print was 
affected by the resin’s contractility [i.e., the 
volumetric reduction undergone during the 
resin solidification process, common to all 
SLA resins and different for each resin for- 
mulation (25)], because even modest contrac- 
tions of a few percent were amplified into 
observable bowing. In contrast, when smaller 
single parts were produced, this contractility 
became less evident and print fidelity increased 
(Fig. 3C). Full use of the large-scale capabilities 
enabled by HARP will require the develop- 
ment of new SLA resins with lower volumet- 
ric contractility. 

By not relying on oxygen quenching, HARP 
is capable of printing both oxygen-sensitive 
and oxygen-insensitive resin chemistries. To 
demonstrate this ability, we printed three dif- 
ferent resins on the HARP 3D printer to gen- 
erate three different classes of materials with 
varied properties: a hard polyurethane acrylate, 
an elastomeric butadiene rubber, and a silicon 
carbide ceramic (see Fig. 4). These resins are a 
mix of resins found in the literature (26, 27), 
obtained from companies, and formulations 
developed for the HARP printer. As with all 
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SLA systems, each resin must be optimized in 
terms of initiators and blockers to perform in 
conjunction with the light engine used and the 
desired vertical print rate. This was optimized 
by altering the photo-initiators used and tuning 
their concentrations to achieve a desired en- 
ergy cure and penetration depth response for 
each resin (24). The hard polyurethane resin 
printed optimally under a vertical print speed 
of 120 um/s. The ceramic resin used thiolene- 
click chemistry, a chemistry known to be rela- 
tively oxygen-insensitive (28, 29), to cross-link 
the polymeric resin and produce a “green” part. 
This polymeric silicone can be post-treated in 
a furnace at 1000°C to undergo pyrolysis to 
eliminate the organic components and pro- 
duce a silicon carbide structure (26). Finally, 
the butadiene rubber resin required a slower 
print speed (30 um/s) owing to its lower re- 
activity and higher viscosity. Although the 
postprocessing and print conditions differed 
for each resin (Fig. 4) (24), all the resins were 
compatible with the HARP approach. This 
makes HARP printing generally useful with 
stereolithographic resins currently available 
through literature or commercial formula- 
tions (26, 29). 

To determine whether the HARP continuous 
print process produces isotropic materials, 
we performed mechanical testing on dog-bone 
structures printed from an ABS (acrylonitrile 
butadiene styrene)-like urethane acrylate 
resin in multiple orientations. This was done 
using the protocols of ASTM standard D638 
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for Type I dog bones (30). The results show that 
the HARP-printed material maintains isotropic 
mechanical properties comparable to a molded 
part (Fig. 3A) and that these properties are 
independent of print orientation, unlike other 
modes of noncontinuous printing (fig. S8) (24). 
The HARP parts exhibited mechanical proper- 
ties comparable to those of injection-molded 
ABS plastic (Fig. 3A, gray line). In addition, the 
HARP-printed structures could be made with 
high print fidelity (Fig. 3, B and C), resolving 
features as small as 300 um (the theoretical 
optical resolution is 100 um and is a function 
of the light-patterning engine) and achieving a 
volumetric correlation of 93% on a test print. 
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HARP-printed parts have a surface ridging 
that depends on the minimal wall thickness 
of the object being printed; thinner part sec- 
tions result in faster resin replenishment rates 
and consequently yield a smoother surface. 
Note that this surface roughness is different 
from the discrete lamination layers in parts 
produced by traditional SLA [both digital light 
processing (DLP) and laser-based approaches] 
and is independent of the vertical resolution 
of the slice patterns being sent to the light- 
patterning engines (10 um z-height). To fur- 
ther characterize this phenomenon, we printed 
an array of 80 Type IV dog-bone structures of 
various thicknesses (1 to 3 mm) and widths 
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Fig. 3. Manufacturing-ready materials and resolution. 
(A) Type | dog-bone structures made from an 
ABS-like polyurethane acrylate resin exhibit isotropic 
mechanical properties (ASTM D638) and are 
comparable to a part cast from the same resin 
(black line) as well as injection molded ABS 

(gray line). (B) HARP also enables high spatial 
resolution and print fidelity, as evidenced by the 
variation between designed and printed features 

as a function of feature size down to ~300 um 

(below this, the ability to resolve parts becomes 
inconsistent) using a light-patterning engine with 
an optical resolution of 100 um. Data points are 
mean values; error bars represent SD across 

0 5-mm posts printed in differing regions of the 
print bed. Red dashed lines represent the bounding 
constraints of +1 pixel for the light-patterning 
engine. (©) A computed tomography (CT) scan 
between a printed part (print rate, 120 um/s; 
optical resolution, 100 um) and its CAD design file 
reveals a volumetric correlation of 93%. Blue 

and purple volumes represent the printed object 
and the CAD object, respectively. Scale bar, 1 cm. 
(D) Representative height profile scans along 

the print direction for a series of 3-mm-thick dog 
bones for varying widths [red, 1 mm; green, 

2 mm; blue, 4 mm; black, 6 mm; see additional 
data in (24) for dog bones 1 to 2.5 mm thick]. 

Red dashed lines represent the bounding 
constraints of +1 pixel for the light-patterning 
engine. (E) Analysis of the profilometry data allows 
for the calculation of the arithmetic surface 
roughness, R, of the printed parts as a function of 
minimum feature dimension and reveals a strong 
linear correlation with a Pearson correlation 
coefficient of r = 0.90 (n = 20, rit = 0.444 for 

P = 0.05). (F) Despite this surface roughness, 

the maximum tensile stress remains invariant 

of the feature size (data points are mean values; 
error bars represent SD across five Type IV dog 
bones, ASTM D638). The inset shows a scatterplot 
for each dog bone, revealing a Pearson correlation 
coefficient between the surface roughness and 
maximum tensile stress of r = -0.34 (n = 25, rit = 0.396 
for P = 0.05). The hard polyurethane acrylate resin 
was used for all experiments in (B) to (F). 


(1 to 6 mm) within the necks of the bones at a 
vertical print rate of 120 um/s. We then char- 
acterized the surfaces via optical microscopy 
and profilometry to quantify the magnitude of 
the profiles and the arithmetic surface rough- 
ness for each geometry. The maximum rough- 
ness, from peak to valley, was determined to 
be within +100 um (i.e., +1 optical pixel) for all 
samples (Fig. 3D and fig. S9) (24), with subs- 
tantially smoother surfaces for parts with a min- 
imal dimension not exceeding 1.5 mm (within 
+10 um). The arithmetic surface roughness 
strongly correlated with the minimum part di- 
mension (i.e., width or thickness; Fig. 3E) with 
a Pearson correlation coefficient of r = 0.90 
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(n= 20, Terit = 0.444 for P = 0.05 representing a 
95% confidence level). Additionally, we printed 
a series of dog-bone structures (80 dog bones, 
five unique geometries in replicate; 120 um/s 
vertical print rate) similar to a Type IV dog 
bone in profile, but with various thicknesses 
ranging from 1 to 3 mm (a typical Type IV dog 
bone is 3.175 mm thick). Although differing 
geometries can give rise to slight variations 
in tensile properties, our hypothesis was that 
despite the varying dog-bone thickness affect- 
ing the surface roughness, the bulk material 
properties would be relatively invariant (Fig. 
3F). Indeed, the Pearson correlation coefficient 
between the surface roughness and maximum 
tensile stress is 7 = -0.34 (2 = 25, Toit = 0.396 
for P = 0.05 representing a 95% confidence 
level; Fig. 3F, inset), validating this hypothesis. 

Taken together, these advances in through- 
put, mobile interface design, and materials 
generality solve several problems associated 
with large-area, large-object 3D printing. Sev- 
eral challenges remain, including the devel- 
opment of high-speed optical systems capable 
of retaining high lateral resolution and deliver- 
ing high optical densities through the interface. 
Additionally, stereolithographic systems, which 
operate at high vertical speeds, remain limited 
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by the availability of low-viscosity and low- 
contractility resins, which can be used to gen- 
erate structures with industrially relevant 
properties. 
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Fig. 4. A wider palette of resins. (A) A hard, 
machinable polyurethane acrylate part (print 
rate, 120 wm/s; optical resolution, 100 um) 
with a hole drilled against the print direction. 
Traditional noncontinuous layer-by-layer 
printing techniques typically delaminate 

and fracture when drilled in this orientation. 
(B) A post-treated silicon carbide ceramic 
printed lattice (print rate of green polymer 
precursor, 120 wm/s; optical resolution, 

100 um) stands up to a propane torch 
(~2000°C). (€ and D) A printed butadiene 
rubber structure (print rate, 30 um/s; 
optical resolution, 100 um) in a relaxed state 
(C) and under tension (D). (E) Polybutadiene 
rubber (print rate, 30 m/s; optical resolution, 
100 um) returns to expanded lattice after 
compression. (F) A ~1.2-m hard polyurethane 
acrylate lattice printed in less than 3 hours 
(vertical print rate, 120 um/s; optical 
resolution, 250 um). Scale bars, 1 cm. 
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Light-driven deracemization enabled by excited-state 


electron transfer 


Nick Y. Shin’, Jonathan M. Ryss?, Xin Zhang’, Scott J. Miller2*, Robert R. Knowles* 


Deracemization is an attractive strategy for asymmetric synthesis, but intrinsic energetic challenges 
have limited its development. Here, we report a deracemization method in which amine derivatives 
undergo spontaneous optical enrichment upon exposure to visible light in the presence of three distinct 
molecular catalysts. Initiated by an excited-state iridium chromophore, this reaction proceeds through 
a sequence of favorable electron, proton, and hydrogen-atom transfer steps that serve to break and 
reform a stereogenic C—-H bond. The enantioselectivity in these reactions is jointly determined by 

two independent stereoselective steps that occur in sequence within the catalytic cycle, giving rise to a 
composite selectivity that is higher than that of either step individually. These reactions represent a 
distinct approach to creating out-of-equilibrium product distributions between substrate enantiomers 


using excited-state redox events. 


nantioselective reactions are essential to 

the pharmaceutical, agrochemical, and 

fine chemical industries, providing access 

to products enriched in just one of two 

mirror-image geometries. Conventional 
enantioselective methods either transform 
achiral starting materials into chiral products 
or rely on kinetic resolutions to differentially 
transform the stereoisomers of chiral reac- 
tants. Both approaches have been the sub- 
ject of extensive interest and development (J). 
By contrast, methods for achieving selective 
deracemization—wherein a racemic mixture of 
a given compound is wholly transformed into 
a single enantiomer of the same molecule—are 
rare, despite their conceptual simplicity and 
potential practical benefits (Fig. 1A) (2-4). 
Two factors complicate the development of 
deracemization methods. First, the conversion 
of a racemic mixture into a single enantiomer 
is unfavorable on thermodynamic grounds 
because of an attendant decrease in entropy. 
Although this effect is small (AG®° = +0.42 kcal/ 
mol at 298 K), it requires that an additional 
source of energy be supplied to drive the reac- 
tion forward. The second challenge is kinetic 
in nature and relates to the principle of micro- 
scopic reversibility (5). As enantiomers are 
equal in energy by definition, any series of ele- 
mentary steps along a single potential sur- 
face that converts (S) to (R) will be equally 
facile in the reverse direction that transforms 
(R) to (S). In the absence of an exogenous driv- 
ing force, this necessarily results in an equi- 
librium (racemic) distribution of products. 
Accordingly, effective deracemizations require 
both an input of energy to impart reaction 
directionality and distinct mechanisms for 
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the elementary steps that respectively create 
and destroy stereochemistry. 

Seminal examples from Turner, Toste, Zhou, 
and others have demonstrated that these re- 
quirements can be met through sequential 
redox transformations fueled by chemically 
compatible (or phase separated) oxidants 
and reductants, wherein the oxidation and 
reduction reactions occur independently and 
in parallel (6-8). Although effective, this ap- 
proach can be challenging to generalize and 
requires that two stoichiometric reagents be 
consumed each time a molecule of substrate 
is processed. Excited-state reactions can also 
satisfy these key mechanistic requirements. Be- 
cause they occur across two distinct potential 
energy surfaces, photochemical transforma- 
tions are not subject to detailed balance and 
can provide access to non-Boltzmann product 
distributions—a benefit that underlies the 
success of many classical photoisomerization 
reactions (9, 10). Moreover, from a practical 
perspective, such processes require no chem- 
ical reagents, produce no stoichiometric waste, 
and consume nothing but photons. Bach et al. 
very recently reported groundbreaking ex- 
amples of photo-driven deracemizations of 
allenes and cyclopropylquinolones using a 
chiral photosensitizer that exhibits different 
energy-transfer efficiencies for the two sub- 
strate enantiomers, resulting in high levels 
of optical enrichment (Fig. 1C) (11, 12). These 
and other photoisomerizations are generally 
understood to proceed through the electronic 
excited states of the substrates themselves, 
which strategically defines a substrate-specific 
paradigm (13-15). 

We present here a complementary platform 
for light-driven deracemizations based on the 
use of excited-state redox events (Fig. 1B). 
These electron transfer-based approaches 
provide an alternative mechanism for driving 
reactions in opposition to a thermodynamic 
gradient yet are likely applicable to a wider 
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range of substrates and reaction types than 
direct excitation or energy transfer-based 
approaches. We have previously shown that 
excited-state redox events can be used to drive 
out-of-equilibrium reactions, such as intermo- 
lecular olefin hydroaminations and the isom- 
erization of cyclic alcohols to linear ketones, 
wherein the reaction products are higher in 
energy than the starting materials (16, 17). Here, 
we extend these studies and describe a method 
for the light-driven deracemization of cyclic 
ureas mediated by a ternary catalyst system 
comprising an Ir(III)-based photoredox cata- 
lyst, a 1,1'-bi-2-naphthol (BINOL)-derived chi- 
ral phosphate base, and a cysteine-containing 
peptide thiol H-atom donor (Fig. 1D). This 
process occurs through a series of favorable 
electron, proton, and H-atom transfer events 
that serve to break and reform a stereogenic 
methine C-H bond. The extent of optical en- 
richment in these reactions is jointly deter- 
mined by two independent enantioselective 
steps that proceed in sequence within the cat- 
alytic cycle. This results in an unusual (and 
beneficial) outcome in which two modestly 
stereoselective steps together result in an ob- 
served selectivity that is higher than that of 
either individual step. The discovery, optimization, 
scope, and a preliminary mechanistic model 
for this process are presented herein. 

We first observed deracemization behavior 
serendipitously while attempting to develop 
an asymmetric variant of a previously re- 
ported hydroamidation reaction mediated by 
an Ir(III)-based photoredox catalyst, a dialkyl 
phosphate base, and an aryl thiol H-atom do- 
nor under visible light irradiation (Fig. 2A) 
(18). We found that the use of chiral BINOL 
phosphates as Bronsted bases was effective 
in this chemistry and resulted in a modest 
amount of enantioselectivity. However, time- 
course studies unexpectedly revealed that the 
urea product Ib was initially formed as a race- 
mate but became slightly optically enriched 
during the course of the reaction. In subse- 
quent control reactions, we subjected racemic 
Ib to the reaction conditions and observed sig- 
nificant optical enrichment with near-complete 
material recovery, indicating that a light-driven 
deracemization pathway was operative. Sim- 
ilarly, when enantiopure (S)-Ib was subjected 
to identical conditions using an achiral phos- 
phate base, racemization of the stereogenic 
C-H bond was observed. 

On the basis of this discovery, we postulated 
that the excited state of the Ir photocatalyst 
reversibly oxidizes the racemic urea substrate 
to form a mixture of transient (and enantio- 
meric) arene radical cations (Fig. 2B). The 
stereogenic C-H bond in the resulting substrate 
radical cation is markedly acidified and can be 
deprotonated by the phosphate base to form a 
neutral a-amino radical (79, 20). However, as 
both the radical cation and the Bronsted base 
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Fig. 1. Reaction development. (A) Thermodynamic and kinetic challenges in developing methods for selective deracemization. (B) General, light-driven strategies for 
achieving out-of-equilibrium deracemization through excited-state redox events. (C) Bach's report on light-driven deracemization through selective 
energy transfer. (D) Light-driven deracemization of cyclic ureas by excited-state electron transfer. 


are chiral, this process serves to kinetically 
resolve the enantiomeric radical cations, with 
the fast-reacting (R)-enantiomer undergoing 
proton transfer while the slower-reacting 
(S)-enantiomer is converted back to the urea 
starting material by charge recombination with 
the reduced Ir(II) state of the photocatalyst. In 
this way, the reaction becomes enriched in the 
slower-reacting (S)-enantiomer. After proton 
transfer, the resulting a-amino radical interme- 
diate can be reduced by H-atom transfer with 
the achiral aryl thiol cocatalyst to return the 
closed-shell urea in a nonselective process. A 
proton-coupled electron transfer (PCET) event 
among the reduced Ir(II) complex, thiyl, and 
the protonated base could then return the 
active forms of all three catalysts. 
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If operational, this mechanism suggests that 
the two steps that create and destroy stereo- 
chemistry in these reactions—proton transfer 
and H-atom transfer—operate independently 
of one another and are mediated by two in- 
dependent catalysts. Accordingly, when both 
the proton-transfer and H-atom-transfer cat- 
alysts are chiral, both elementary steps can 
potentially be rendered enantioselective, re- 
sulting in an unusual circumstance in which 
the observed stereoselectivity should be the 
product of the enantiomeric ratios for each 
of the two enantioselective steps (eYrgps = 
eYrpr * €Tyat)- 

To evaluate this hypothesis, we elected to 
further study the deracemization of N-aryl- 
substituted cyclic ureas. Preliminary studies 
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demonstrated that a pendant amide H-bond 
donor group is crucial for obtaining high se- 
lectivities in the enantioselective deprotona- 
tion step using the chiral phosphate bases 
(vide infra), prompting us to evaluate the 
deracemization of urea 2a as a model sub- 
strate (Fig. 3A). A small collection of BINOL- 
derived phosphate bases were explored with 
[Ir(dF(CF3)ppy)2(bpy) ]PF, (Ir) and an achiral 
thiophenol H-atom donor catalyst in tetra- 
hydrofuran under irradiation with blue light- 
emitting diodes (LEDs) at room temperature. 
Although a BINOL-derived phosphate with 
l-adamantyl groups (3a) gave essentially 
racemic product, an analogous catalyst bear- 
ing phenyl groups gave an improved er of 
69:31 (8b) (entries 1 and 2). We hypothesized 
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Fig. 2. Discovery of light-driven deracemization (A) Initial observations. Ir is racemic in all experiments. (B) Postulated mechanism. 


that there might be a stabilizing 2-cation 
interaction between the oxidized substrate 
radical cation and the aryl substituents of 
chiral phosphate, prompting us to examine 
catalysts bearing more expansive aryl substit- 
uents (27). This in turn led to catalyst 3e, which 
gave 79:21 er and good yield (entries 3 to 5). 
The addition of molecular sieves further im- 
proved the selectivity to 86:14 er (entry 6). 
We then investigated cysteine-based oligo- 
peptides as enantioselective H-atom-transfer 
catalysts. Although cysteine residues are known 
to mediate H-atom-transfer reactions in a vari- 
ety of biological contexts, they have only occa- 
sionally been explored for use in small-molecule 
asymmetric catalysis, and their study for cat- 
alytic asymmetric H-atom transfer is not yet 
reported (22-25). A small library of tetrapep- 
tide disulfides (which are in equilibrium with 
their free thiol form under the reaction con- 
ditions) were initially screened with a cata- 
lytic amount of achiral tetrabutylammonium 
diphenylphosphate base, providing a lead re- 
sult of 68:32 er with peptide 4b (entries 7 to 9). 
The corresponding thiol 4e gave a slightly 
improved er of 70:30 (entry 10). Variations at 
the 7+2 and 7+3 positions demonstrated that 
cysteine-embedded tetrapeptide with phenyl- 
glycine as the C-terminal residue show im- 
proved er, leading to 4e with 78:22 er and good 
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yield (entries 11 and 12). The use of molecular 
sieves slightly improved the selectivity to 79:21 
er (entry 13). With optimized chiral phosphate 
8e and chiral thiol 4e in hand, our mechanis- 
tic hypothesis predicted that reactions medi- 
ated by the stereochemically matched pair of 
catalysts should result in an observed er of 96:4 
(86:14 © 79:21 = 96:4) (26). Reaction of rac-2a 
with 3e, 4e, Ir, and molecular sieves indeed 
produced optically enriched 2a, with a com- 
posite selectivity of 93:7 er (entry 14) upon ir- 
radiation with blue LEDs. We speculated that 
the modest deviation from the predicted er 
value might result from a thiyl-mediated race- 
mization pathway that may become operative 
at high levels of optical enrichment (27). We 
postulated that inclusion of an alternative 
H-atom donor with a much weaker C-H bond 
might react preferentially with the alkyl thiyl 
radical and suppress any undesired racemiza- 
tion. Indeed, the addition of 50 mole % (mol %) 
triphenylmethane to the optimal conditions 
improved the er to the expected value of 96:4 
(entry 15). Further investigation revealed that 
the use of lower catalyst loadings was also 
effective (entry 16). Control reactions revealed 
that exclusion of light or any of the reaction 
components resulted in either complete loss 
of optical enrichment or diminished levels of 
enantioselectivity (table S1). 
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With these optimized deracemization con- 
ditions in hand, we found that a variety of 
structural changes in the urea substrate could 
be accommodated (Fig. 3B). Alkyl substituents 
of varying steric demand could be tolerated at 
the stereogenic carbon with uniformly good 
levels of enantioselectivity (2a to 2e). Sub- 
stitution of the distal urea nitrogen with the 
free N-H amide, benzyl, or isopropyl groups 
also provided the desired deracemized prod- 
ucts with high levels of stereoselectivity (2f 
to 2h). Structural changes on the acyclic amide 
moiety were also tolerated, as was a benzamide 
derivative (2i to 2n). However, an N,N-dimethyl 
amide variant demonstrated a noticeable 
decrease in the er (20). Upon reaching the 
steady-state level of optical enrichment, all of 
the substrates studied here (2a to 20) can be 
recovered in nearly quantitative yield. 

Numerous observations are consistent with 
the mechanistic proposal outlined above 
(Fig. 4A). Steady-state Stern-Volmer quench- 
ing studies and time-correlated single-photon 
counting experiments revealed that electron 
transfer between the urea substrate (Ep/2 = 
0.91 V versus Fc*/Fc in MeCN; fig. S1) and the 
excited state of Ir (*Ey2 = 0.94 V versus Fe*/Fe 
in MeCN) (28) is kinetically rapid [the rate 
constants for electron transfer events (Kg) 
for (R)-2a and (S)-2a@ are 9.0(8) x 10° M's? 
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conditions. Reactions were performed on a 0.025 mmol scale. Yields were 


determined by ‘H-nuclear magnetic resonance (NMR) 


mixtures relative to an internal standard. The er was determined by high- 
performance liquid chromatography (HPLC) analysis on a chiral stationary phase. 


*10% m/v of molecular sieves (MS). t5 mol % of dis 


and 8.8(7) x 10° M“'s_', respectively; fig. $2]. 
The subsequent proton transfer, H-atom trans- 
fer, and PCET steps are also thermodynamically 
favorable (fig. S3). These findings reinforce the 
notion that, as all the elementary steps pro- 
ceeding from *Ir are exergonic, the observed 
product distributions are kinetically controlled 
and fully decoupled from the energetic differ- 
ence between the racemic starting material 
and the optically enriched product. The steady- 
state er is achieved within only 1.5 hours, and 
no degradation of either yield or er was ob- 
served upon extended reaction times, sug- 
gesting that the system establishes a stable 
nonequilibrium state (Fig. 4B). The quantum 
yield of this process was measured to be 4.8(3)% 
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controlled stereoinversion from optically pure 
(S)-2a to (R)-2a could be achieved under the 
optimized condition with excellent er and re- 
action efficiency (Fig. 4C). 

To investigate the generality of proposed 
synergistic stereoselectivity between the two 
chiral catalysts, the enantioselectivity of each 
catalyst was explored individually for a selected 
set of substrates (Table 1). First, substrates were 
subjected to deracemization with chiral base 
8e and thiophenol as an achiral H-atom trans- 
fer catalyst, conditions that will directly re- 
port on the er of the proton-transfer step. 
The observed er was generally consistent for 
substrates bearing a pendant amide group 
(entries 1 to 10). Methylation of the amide 


(fig. S5) (29). We also found that catalyst- 
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(B) Reaction scope. Reactions were run at a 0.25 mmol scale unless otherwise 
noted. Yields and er values are for isolated material after purification and are the 
average of two experiments. In parentheses are yields and er’s obtained on a 
0.025 mmol scale analyzed by ‘H-NMR and HPLC analysis, in which the internal 
reaction temperature was measured to be 25°C. §Reaction scale, 0.10 mmol. 
qReaction time, 12 hours. #Reaction scale, 0.025 mmol, NMR yield. 


decreased the er (entries 11 and 12), which 
supports a potential interaction between the 
distal amide N-H bond and the phosphate base 
during the asymmetric proton-transfer step. 
The same substrates were then deracemized 
with cysteine-embedded peptide 4e and tet- 
rabutylammonium diphenylphosphate as 
an achiral Bronsted base catalyst, providing 
a measure of the selectivity in the H-atom 
transfer step. Although substitution of the 
urea backbone led to small variations, the 
enantioselectivity in the HAT step is similar 
for all substrates with or without the pendant 
amide group. When the stereochemically 
matched forms of both chiral catalysts were 
used [e.g., the phosphate selectively ablates 
the (S) enantiomer and the thiol preferentially 
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Fig. 4. Preliminary mechanistic studies. (A) Free-energy profile of light-driven deracemization from rac-2a to (R)-2a. Details are included in the supplementary 
materials (fig. S3). (B) Time-course studies for deracemization of rac-2a to (R)-2a. (C) Selective stereoinversion of (S)-2a to (R)-2a. 


Table 1. Studies on synergistic enantioselectivity of each chiral catalyst and synergistic stereoselectivity. 


Experimental er Experimental er Predicted er Experimental er 
Entry Substrate (erpy), chiral (erat), chiral (erpt ° elyar) (€rops), Chiral 
base only thiol only base + chiral thiol 


lb 49:51 76:24 74:26 


Reaction conditions: 2 mol % Ir, 10 mol % 3e, 10 mol % PhSH, MS, THF, blue LEDs, 25°C, 4 hours (column 3), 2 mol % Ir, 10 mol % NBug*(PhO)2P(0)O-, 10 mol % 4e (or ent-4e), 50 mol % 
Ph3CH, MS, THF, blue LEDs, 25°C, 4 hours (column 4), 2 mol % Ir, 5 mol % 3e, 5 mol % 4e (or ent-4e), 25 mol % Ph3CH, MS, THF, blue LEDs, 25°C, 4 hours (column 6). The reaction yields in all 
cases are >90%. Detailed experimental results are included in the supplementary materials (table S7). erpr, proton-transfer er; erat, H-atom-transfer er; erops, observed er. 
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reforms the (R) enantiomer], the resultant er 
value closely matched the predicted value in 
all cases. Similarly, the combination of mis- 
matched thiol ent-4e with base 3e provided 
diminished er for 2a, indicating conflicting 
stereochemical preferences wherein the de- 
protonation and HAT events both favor the (S) 
enantiomer (entry 2). These observations are 
consistent with the proposed mechanism and 
highlight the synergistic role of the two chiral 
catalysts in this transformation. 

We anticipate that the mechanistic features 
underlying this work are general and may be 
adapted to a wide variety of other light-driven 
transformations to provide nonequilibrium 
product distributions in a catalyst-controlled 
fashion. 
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QUANTUM COMPUTING 


Deterministic generation of a two-dimensional 


cluster state 


Mikkel V. Larsen*, Xueshi Guo, Casper R. Breum, Jonas S. Neergaard-Nielsen, Ulrik L. Andersen* 


Measurement-based quantum computation offers exponential computational speed-up through simple 
measurements on a large entangled cluster state. We propose and demonstrate a scalable scheme for 
the generation of photonic cluster states suitable for universal measurement-based quantum 
computation. We exploit temporal multiplexing of squeezed light modes, delay loops, and beam-splitter 
transformations to deterministically generate a cylindrical cluster state with a two-dimensional (2D) 
topological structure as required for universal quantum information processing. The generated state 
consists of more than 30,000 entangled modes arranged in a cylindrical lattice with 24 modes on the 
circumference, defining the input register, and a length of 1250 modes, defining the computation depth. 
Our demonstrated source of two-dimensional cluster states can be combined with quantum error 
correction to enable fault-tolerant quantum computation. 


uantum computing represents a new 

paradigm for information processing 

that harnesses the inherent nonclassical 

features of quantum physics to find solu- 

tions to problems that are computation- 
ally intractable on classical processors (J). In 
measurement-based, or cluster state, quantum 
computing (MBQC), the processing is performed 
by simple single-site measurements on a large 
entangled cluster state (2). This constitutes a 
simplification over the standard gate-based 
model of quantum computing, as it replaces 
complex coherent unitary dynamics with sim- 
ple projective measurements. However, one of 
the outstanding challenges in realizing cluster 
state computation is the reliable, deterministic, 
and scalable generation of nonclassical entan- 
gled states suitable for universal information 
processing. 

Several candidate platforms for scalable clus- 
ter state generation have been proposed and 
some experimentally realized, including solid- 
state superconducting qubits (3), trapped ion 
qubits (4, 5), and photonic qubits or qumodes, 
in which qubits can be encoded, generated by 
parametric down-conversion (6-9) or by quan- 
tum dots (J0). However, none of these imple- 
mentations has demonstrated true scalability 
combined with computational universality. The 
largest cluster state generated to date is a tem- 
porally multiplexed photonic state comprising 
entangled modes in a long chain, which, how- 
ever, does not allow for universal computation 
because of its one-dimensional (1D) topolo- 
gical structure (8, 17). To achieve universality, 
the dimension of the cluster state must be at 
least two. Several proposals for generating 2D 
cluster states in different systems have been 
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proposed (12-15), but owing to technical chal- 
lenges, scalable and computationally universal 
cluster states have yet to be produced in any 
physical system. 

We propose and demonstrate a highly scal- 
able scheme for the generation of cluster states 
for universal quantum computation based on 
quantum continuous variables (CVs) where 
information is encoded in the position or mo- 
mentum quadratures of photonic harmonic 
oscillators (16). We use a temporally multiplexed 
source of optical Einstein-Podolsky-Rosen (EPR) 
states (17) to generate a long string of entangled 
modes that is curled up and fused to form a 2D 
cylindrical array of entangled modes. Specifi- 
cally, we generate a massive cluster state of 
more than 30,000 entangled modes compris- 
ing an input register of 2 x 12 = 24 modes on 
which the input state may be encoded, and a 
length of 1250 modes for encoding operations 
by projective measurements, limited only by 
the phase stability of our setup. In addition to 
being universal and deterministically gene- 
rated, the source is operated under ambient 
conditions in optical fibers at the low-loss tele- 
com wavelength of 1550 nm. These favorable 
operational conditions and specifications facil- 
itate further upscaling of the entangled state, 
as well as its use in applications and funda- 
mental studies. 

The canonical approach to CV cluster state 
generation is to apply two-mode controlled-Z 
gates onto pairs of individually prepared eigen- 
states of the momentum (or phase quadrature) 
operators p;, p; in adjacent modes 7, j. The gate 
is described by the unitary operation Cz = 
e'stits where #;,%; are the position (ampli- 
tude quadrature) operators of mode 7 and J, 
and g is the interaction strength. Applying this 
gate to two modes leads to entanglement in 
the form of quantum correlations of the two 
modes’ quadratures. The operations and result- 
ing state can be represented by a graph in 
which the nodes represent the momentum 
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eigenstates and the edges (links) between the 
nodes represent the application of a controlled- 
Z operation, where the interaction strength 
is given by the edge weight. In a practical im- 
plementation, the unphysical momentum eigen- 
states are replaced by highly squeezed states, 
while the controlled-Z operations can be imi- 
tated by phase shifts and beam-splitter trans- 
formations. To enable scalability, it has been 
suggested that multiplexing of spatial modes 
(18), frequency modes (19, 20), or temporal 
modes (13, 27) be used. For example, Menicucci 
suggested using temporal multiplexing to form 
a 2D cluster state combining four squeezed 
state generators, five beam splitters, and two 
delay lines (13). 

We propose a simpler approach to 2D cluster 
state generation that lowers the experimental 
requirements (Fig. 1). The state is produced in 
four steps: (i) Pairs of squeezed vacuum states 
are generated at 1550-nm wavelength from 
two bowtie-shaped optical parametric oscilla- 
tors (OPOs) by parametric down conversion (22). 
The states are defined in consecutive temporal 
modes of duration t of the continuously gen- 
erated OPO output. (ii) The squeezed vacuum 
pairs in spatial modes A and B are interfered 
on a balanced beam splitter (denoted BS,). This 
produces a train of pairwise EPR-entangled tem- 
poral modes that exhibit quantum correlation 
between the position and momentum quadra- 
tures. Each EPR pair can be represented by a 
simple graph of a single edge connecting two 
nodes. (iii) A 1D cluster state is formed by 
delaying one arm of the interferometer by t with 
respect to the other arm and interfering the 
resulting time-synchronized modes on another 
balanced beam splitter (denoted BS.). The in- 
terference entangles EPR pairs along an inde- 
finitely long chain, creating a 1D graph. (iv) In 
the final step, the 2D cluster state is produced 
by introducing another delay to one interfero- 
meter arm of duration Nt and interfering the 
resulting time-synchronized modes on a final 
beam splitter (denoted BS;3). This effectively 
curls up the graph and fuses the modes into 
an indefinitely long cylinder with NV nodes on 
the circumference, as illustrated in Fig. 1 for 
N = 12, leading to 2 x N = 24 input modes 
distributed on the two spatial modes A and B. 
For a detailed description of experimental im- 
plementations, see material and methods (23). 

All states and operations involved are Gaussian, 
meaning they can be described by Gaussian 
distributions of the quadrature variables in phase 
space. In the formalism of graphical calculus 
for Gaussian states (24), the generated graphs 
are so-called H-graphs, as they can be gen- 
erated from vacuum by a single Hamiltonian 
and have an edge weight of g = 7 sinh(27)G, 
where 7 is the squeezing parameter of the two 
squeezing operations and G = —1 for the EPR 
states, +1/2 for the 1D graph, and +1/4, 1/2 for 
the 2D graph. Owing to the particular structure 
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of the H-graph generated here [it is self-inverse 
and bipartite; see supplementary text section 
1.1 for details (23)], it can be transformed into 
a cluster state by 2/2 rotations in phase space, 
leading to real edges of weight g = tanh(27r)G > 
G for r > ». Finally, as the m/2 phase space 
rotations can be absorbed into the measure- 
ment basis, or simply by appropriate redefi- 
nitions of quadratures on the rotated modes, 
the generated H-graph state and its correspond- 
ing cluster state are completely equivalent. [See 
supplementary text section 1.2 for details on the 
cluster state generation scheme (23).] 

The produced cylindrical 2D cluster state 
can be shown to be a universal resource for 
quantum computing: In Fig. 2, the gener- 
ated cylindrical cluster state is unfolded and 
projected into a square lattice by projective 
measurements in the position basis and n/2 
phase-space rotations of different modes. Such 
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Fig. 2. Universality of the generated 2D cluster state. (A) Graph of the 
generated 2D cluster state. Measuring the nodes marked by red in the position 


basis removes all edges connected to the measured 


graph unfolds to a plane. (B) Resulting plane 2D cluster state after the projective 
measurements in (A), consisting of two bilayer square lattices (double BSL) 
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a square lattice is a well-known universal re- 
source for quantum computing (25), and thus 
the initial cylindrical cluster state is itself uni- 
versal. For computation, it is not necessary to 
project the generated cluster state into a square 
lattice—rather, one would in general optimize 
the detector settings required for the gate to 
be implemented. For instance, with proper set- 
tings, the cluster state can be projected into 1D 
dual-rail wires along the cylinder, an efficient 
resource for one-mode computation (8, 27) and 
with possible two-mode interactions between 
them [for details, see supplementary text sec- 


the measurement bases in between temporal 
modes, whereas in this work, the cluster state is 
measured in fixed bases for state verification. 
Multipartite cluster state inseparability can 
be witnessed through the measurement of the 
uncertainties of the state nullifiers—linear com- 
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tion 1.4 (23)]. Doing so requires fast control of 


binations of position and momentum operators 
for which the cluster states are eigenstates with 
eigenvalue 0. For example, for the ideal two- 
mode EPR state, the well-known nullifiers 
are Nipp = L4 — Lg and NEpp = D4 + Dz be- 
cause ftpp |EPR) = 0 and 7p, |EPR) = 0. For 
our 2D cluster state, |2D), the nullifiers consist 
of eight modes and are given by 


*B 
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as 7;|2D) =0 and nP|2D) = 0 [derived in 
supplementary text section 1.3 (23)], where 
the subscript indicates the temporal mode 
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homodyne detectors (HD, and HDg), the temporal mode quadratures are 
measured, from which the nullifiers are calculated. In the experimental 
implementation, the short delay is a 50.5-m fiber leading to temporal modes of 
247-ns duration, whereas the long delay is a 606-m fiber such that N = 12, as 
shown in the illustrated graph. The temporal modes are defined by an 
asymmetric-shaped temporal mode function within the 247-ns duration, which 
filters out low-frequency noise and leads to less than 10°? mode overlap (11). 
For more information, see material and methods (23). 


— > D Universal SL 


Time 
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connected by edges of weight 1/2. (C) Single BSL after projective measurement 
of half the modes in (B) in the position basis. (D) Square lattice (SL) after 
projective position measurements of all modes in spatial mode B (dark nodes), 
and applying the Fourier gate (x/2 phase delay) on half the modes in spatial mode 
A (bright nodes). This SL is a traditional universal resource state for MBQC. 
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Fig. 3. Experimental result. On the right graph, the nullifiers in Eq. 1 and 2 are shown on the 2D cluster state 
lattice with the measured variance of 1500 consecutive nullifiers shown in the left plot. Here, the variance is 
calculated from 10,000 measurements of each nullifier. All nullifier variances are seen to be well below the -3-dB 
inseparability bound derived in supplementary text section 2 (23), and thus the generated cluster state is 
completely inseparable. In the inset, the nullifier variance of a larger data set with 2 x 15,000 = 30,000 modes 
is shown. Again, with all modes below the -3-dB inseparability bound, we conclude the successful generation of a 
30,000-mode 2D cluster state. The rapid increase of the variance in Af and its periodic variation is caused by 
phase fluctuation of the squeezing sources, as described in supplementary text section 4 (23). 


index and Nis the number of temporal modes 
in the cluster state circumference. 

The practically realizable cluster state is never 
an exact eigenstate of the nullifiers, because 
such a state is unphysical. The measurement 
outcomes of the nullifiers are therefore not 
exactly zero in every measurement but possess 
some uncertainties around zero. A condition 
for complete inseparability of the 2D cluster 
state [derived in supplementary text section 
2 (23)] leads to a bound on the variances of all 
nullifiers of 3 dB squeezing below the shot noise 
level. Therefore, to witness full inseparability, 
we must observe more than 3-dB squeezing 
for all nullifiers. In Fig. 3, the measured nul- 
lifier variances are shown for a dataset of 
1500 nullifiers, and they are all observed to be 
well below the —3-dB bound; we measure an 
averaged variance of —4..7 and —4..3 dB for 7; 
and nf, respectively. In the inset of Fig. 3, we 
present the measurement of a longer cluster 
state of 15,000 temporal modes, which corre- 
sponds to a measurement time of 4 ms. Al- 
though phase instabilities are clearly seen to 
affect the performance in terms of variations 
of the nullifier variances, all variances stay 
below the —3-dB bound. The 2D cluster of 2 x 
15,000 = 30,000 modes is thus fully inseparable. 
Not all 30,000 modes of the cluster state need 
to exist simultaneously when performing pro- 
jective measurements for computation. Indeed, 
only a single temporal mode of the cluster state 
needs to exist while the remaining modes of 
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the state are under construction. Hence, the 
cluster state can be immediately consumed 
for computation while being generated, with 
no additional state storage necessary (see sup- 
plementary text section 1.4 for a possible mea- 
surement scheme for computation on the cluster 
state). 

With the deterministic generation of a uni- 
versal 2D cluster state, we have [in parallel with 
Asavanant e¢ al. (26)] constructed a platform 
for universal MBQC. Its scalability was dem- 
onstrated by entangling 30,000 optical modes 
in a 2D lattice that includes 24 input modes and 
allows for a computation depth of 1250 modes. 
Because only a few modes exist simultane- 
ously, we are not limited by the coherence 
time of the light source, and thus the number 
of operations depends only on the phase sta- 
bility of the system. Unlimited computational 
depth can therefore be achieved by imple- 
menting continuous feedback control of the 
system for phase stabilization, as demon- 
strated for the 1D photonic cluster state in 
(11). The results presented here and in (26) are 
similar: Both 2D cluster states are generated 
deterministically in the CV regime with com- 
parable size and amount of squeezing in the 
nullifier variance. However, with only two 
squeezing sources, three interference points, 
and operation in fiber, the experimental setup 
demonstrated here is simpler, whereas in (26), 
larger-bandwidth OPOs are demonstrated that 
result in shorter delay lines. In both systems, 
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the number of input modes can be readily in- 
creased by using OPOs with larger bandwidths, 
possibly combined with a longer time delay of 
the second interferometer. For example, using 
OPOs with a 1-GHz bandwidth (65 times wider) 
and an interferometer delay that is twice as 
long, a state with ~1500 input modes can be 
generated. Large-bandwidth OPOs have been 
demonstrated, but phase stability and losses in 
the delay lines are more challenging. Although 
phase fluctuation depends only on experimen- 
tal control, which we expect to improve with 
continuous phase stabilization, delay losses are 
unavoidable, and increasing the OPO band- 
width may be a better solution than increasing 
the delay lengths. 

CV cluster states are described by Gaussian 
statistics, but it is known that an element (state, 
operation, or measurement) of non-Gaussian 
quadrature statistics is required for universal 
quantum computing (27). Such an element could 
be a photon number-resolving detector (PNRD) 
or an ancillary cubic-phase state (2/7, 28). De- 
spite recent experimental efforts in developing 
high-efficiency PNRD (29) and deterministically 
generating optical states with non-Gaussian 
statistics (30), the formation of the required 
non-Gaussianity of the cluster state still con- 
stitutes an important challenge to be tackled 
in the future. Another currently limiting factor 
to achieving quantum computation is the exis- 
tence of finite squeezing in the cluster, leading 
to excess quantum noise and thus computational 
errors. However, these errors can be circum- 
vented using Gottesman-Kitaev-Preskill (GKP) 
state encoding (28) concatenated with tradi- 
tional qubit error correction schemes, leading 
to fault-tolerant computation with a 15- to 17-dB 
squeezing threshold (37). Another recently dis- 
covered advantage of the GKP encoding is that 
in addition to fault tolerance, it also allows for 
universality without adding extra non-Gaussian 
states or operations (32). Although GKP states 
have recently been produced in the microwave 
regime (33) and in trapped-ion mechanical oscil- 
lators (34), their production in the optical regime 
remains a task for future work. [For further 
discussion on quantum computation using the 
generated cluster state, see supplementary text 
section 1.4 (23).] Although a path toward fault- 
tolerant universal quantum computing using 
CV cluster states has been established, it is 
highly likely that the first demonstrations of 
CV quantum computation will be nonuni- 
versal algorithmic subroutines such as boson 
sampling and instantaneous quantum com- 
puting (35). With the large, but noisy, cluster 
state demonstrated here, interesting future 
work will be to implement basic Gaussian 
circuits and investigate, for example, the at- 
tainable circuit depth. Furthermore, the tech- 
nique of folding a 1D cluster state into a 2D 
structure could be extended using an addi- 
tional interferometer to form 3D cluster states, 
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which might be suitable for topologically pro- 
tected MBQC. 
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QUANTUM COMPUTING 


Generation of time-domain-multiplexed 
two-dimensional cluster state 


Warit Asavanant’, Yu Shiozawa’, Shota Yokoyama“, Baramee Charoensombutamon’, Hiroki Emura’, 
Rafael N. Alexander®, Shuntaro Takeda, Jun-ichi Yoshikawa’, Nicolas C. Menicucci°, 


Hidehiro Yonezawa‘, Akira Furusawa’* 


Entanglement is the key resource for measurement-based quantum computing. It is stored in quantum 
states known as cluster states, which are prepared offline and enable quantum computing by means 
of purely local measurements. Universal quantum computing requires cluster states that are both 
large and possess (at least) a two-dimensional topology. Continuous-variable cluster states—based on 
bosonic modes rather than qubits—have previously been generated on a scale exceeding one million 
modes, but only in one dimension. Here, we report generation of a large-scale two-dimensional 
continuous-variable cluster state. Its structure consists of a 5- by 1240-site square lattice that was 
tailored to our highly scalable time-multiplexed experimental platform. It is compatible with Bosonic 
error-correcting codes that, with higher squeezing, enable fault-tolerant quantum computation. 


uantum computers promise applica- 

tions beyond what is possible with their 
classical counterparts (7). Recent work 

has produced advances in quantum com- 
puting with stationary matter qubits 

such as superconducting systems (2), ion trap 
systems (3), and silicon-based systems (4). With 
access to both high-quality qubits and high- 
fidelity quantum gates, it is thus believed that a 
small-scale quantum computer is within reach. 
In order to bring these platforms to the scale at 
which they become useful, efforts [for example, 
(5)] have been made to address the difficulties 
of preparing, interfacing, addressing, and tuning 
large numbers of qubits. However, a techno- 
logical leap will be necessary to achieve a 
scalable quantum computer along these lines. 
In the light of this, we sought a more direct 
path to scalability. Rather than sequentially 
preparing and interfacing qubits, one can 
prepare a large-scale quantum resource state 
whose constituent quantum systems (such as 
qubits or modes) are interconnected in ad- 
vance. Provided that this resource—known as 
a cluster state (6)—has the required structure 
and scale, then quantum computing can pro- 
ceed by means of purely local (single-site) quan- 
tum measurements. This paradigm is known 
as measurement-based quantum computation 
(MBQC) (7, 8). The scale of the cluster state 
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typically determines the number of possible 
computational steps. Its structure—codified as 
a graph that describes the entanglement prop- 
erties of the state—determines which logic gates 
can be implemented with local measurements. 
For example, a cluster state corresponding to 
multiple disjoint one-dimensional (1D) graphs 
(Fig. 1A) can be used to implement multistep 
computations, with teleportation along the 
length of the cluster state being analogous to 
time evolution along a single circuit wire. How- 
ever, the absence of connections between wires 
makes this state incapable of generating en- 
tanglement between inputs on separate wires, 
and hence, this resource is insufficient for uni- 
versal quantum computing. A 2D square lat- 
tice graph (Fig. 1B) possesses connectivity in 
two directions, allowing for time evolution 
through teleportation in one direction and the 
multi-input unitary evolution (that generates 
entanglement) between inputs through con- 
nections in the other direction. It enables uni- 
versal quantum computing by means of local 
measurements and hence is a universal re- 
source state for MBQC. 

Finding a universal resource state that is 
experimentally feasible (so that it can be gen- 
erated on a large scale) is therefore of para- 
mount importance to the development of a 
universal measurement-based quantum com- 
puter. Cluster states based on bosonic modes— 
known as continuous-variable cluster states 
(8)—can be generated deterministically by 
using compact quantum optics experiments 
(9). Despite several theoretical proposals for 
generating universal continuous-variable clus- 
ter states [for example, (9-12)], until now the 
current state-of-the-art experimental demon- 
strations have been limited to either small- 
scale (few-system) cluster states (13) or to the 
generation of large-scale 1D cluster states, 
which are insufficient for multi-input MBQC 
(14-16). Generation of even moderately sized 
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2D cluster states by using discrete variables 
(qubits) has also never been reported. 

In this work, we report the generation of 
a large-scale universal continuous-variable 
cluster state. The scalability of our experi- 
mental method stems from using a time- 
domain multiplexing (TDM) architecture for 
continuous-variable (CV) optical systems (9). 
Optical CV quantum information is encoded 
within continuous-valued quadratures2;, and 
p, that satisfy [@;, p;,| = thd, and correspond 
to the complex electric field amplitudes of op- 
tical modes residing in temporally localized 
wave packets. An unlimited number of modes 
can be continuously and deterministically pre- 
pared from a few sources and then transformed 
into a continuous-variable cluster state through 
the repeated use of a circuit made of only a 
small number of optical components. Entan- 
glement in the resulting state can be observed 
from correlations in quadrature values of dif- 
ferent temporal modes. In this way, we gen- 
erate and verify a universal cluster state with 
a 5- by 1240-site 2D square lattice structure, 
with the possibility of further extension along 
both lattice dimensions by several orders of 
magnitude in the near future by use of current 
technology. 

Our method generates a continuous-variable 
cluster state from four squeezed light sources 
and a linear optical network consisting of five 
beam splitters and two delay lines (Fig. 1C). 
After single-mode squeezed states leave each 
optical parametric oscillator (OPO), they are 
converted to two-mode squeezed states by 
beam splitter 1 and beam splitter 2 and then a 
four-mode entangled state by beam splitter 3. 
Next, the delay lines multiplex these in time so 
that they form a 2D (but still disconnected) 
grid layout with cylindrical boundary condi- 
tions. Last, beam splitter 4 and beam splitter 5 
produce additional connections, resulting in 
a continuous cylindrical structure (Fig. 1D) 
[as shown for the case in which N = 30]. A 
more detailed description of this generation 
procedure can be found in (17), section SM2. 
The surface of this cylinder consists of sites 
(macronodes) that are arranged in a 2D square 
lattice whose temporal modes (micronodes) 
are connected in a nontrivial way (Fig. 1E). 
The delay between temporal modes arriving 
at each detector is set by the shorter delay At. 
The longer delay line NAt controls the cir- 
cumference length CN sites) of the cylinder. 
Therefore, our method can generate arbitrar- 
ily long 2D cluster states, and only the length 
of the long delay line needs to be increased to 
extend the circumferential dimension. One 
limitation on Nis that NAt must be below the 
coherence time of the light source. 

Relative to the 1D case (/4, 15), generating a 
2D cluster state requires a more complex network 
of beam splitters. This is further complicated 
by stabilization issues and additional optical 
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losses that arise from requiring the longer 
delay line NAt and adjustment issues that 
arise from the constraint that the length ratio 
of the two optical delay lines must be an in- 
teger. The former can be addressed by instead 
shortening the length of each temporal-mode 
wave packet, which allows us to make the 
cluster state using delay lines with shorter 
lengths. We achieved this by developing broad- 
band squeezed light sources, broadband homo- 
dyne detectors, and control systems (78). With 
this, we reduced the width of the wave packet 
from ~160 ns in (74, 15) to At = 40 ns. Regarding 
the latter issue, the current setup was designed 


A One-input quantum computation Cc 


Input 1-dimensional cluster state 


*—O-0-0-0- 


— —  - 


Information flow 


B Universal multi-input quantum computation 
Inputs 2-dimensional cluster state 


Information flow 


Fig. 1. MBQC and 2D cluster state. (A and B) Abstract illustration of MBQC. 
(A) One-input MBQC by using 1D cluster state. (B) Universal multi-input 
MBQC using 2D cluster state. Each colored circle represents a mode, and each 
link represents quantum entanglement. (C) Schematic of our experimental 
setup for the 2D cluster state. OPO, optical parametric oscillator; BS, beam 
splitter; ODL, optical delay line; At, time interval between adjacent wave 
packets; N, integer corresponding to number of inputs that 2D cluster states 
can take in computation; HD, homodyne detector. All beam splitters are 
50:50. (D and E) 2D cluster state. (D) Example for the case in which N = 30. 


Asavanant et al., Science 366, 373-376 (2019) 


so that the lengths of two optical delay lines can 
be easily measured and adjusted (77) while 
keeping the number of the optical compo- 
nents the same or less than that in the pre- 
vious proposals (9, 10). We picked N = 5 for 
experimental demonstration. 

To implement universal CV quantum com- 
putation, we require the ability to perform 
arbitrary multimode Gaussian operations and 
at least one non-Gaussian operation (19). Al- 
though it is possible to map the structure of 
our 2D square lattice cluster state to the more 
standard square lattice [whose methodology 
for implementing MBQC is known (8)], such a 


oe ' 
A . A 


mapping introduces excessive noise (20). We 
have noticed that a similar experiment of gen- 
erating a 2D cluster state has recently been re- 
ported (27), although the methodology to avoid 
such a mapping has not been developed for 
their cluster state yet. Here, we describe a more 
efficient method to use our cluster state without 
such mapping. The structure of multimode 
Gaussian quantum circuits that can be imple- 
mented with our resource state by means of 
local homodyne measurements is shown in 
Fig. 2A. The number of input modes N is equal 
to the circumference length, and each gate 
is implemented by means of teleportation 


OND) 


Squeezed lights —— Square-shaped cluster states ———P 2-dimensional cluster state 


Macronode 
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(E) Zoom in of the state. The representations of states make use of the 
simplified graphical calculus (9). Each small node (small colored sphere) of 
the graph, which we call a micronode, represents a localized wave packet 

at each temporal index. Four micronodes at each temporal index k can 

be grouped into a single site (large gray sphere), called a macronode. The 
links and their colors represent how micronodes are entangled. The 2D cluster 
state has a helical graph structure, with N macronodes on every single turn 
of the helix. For actual experimental demonstration, we use N = 5. Full 
descriptions are given in (17). 
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along the length of the cylindrical cluster 
state. Further details for how to implement 
multimode Gaussian operations are given in 
(17), section SM6. Non-Gaussian operations 
can be implemented by replacing homodyne-C 
or homodyne-D with cubic-phase ancilla as- 
sisted measurement (J0). When implement- 
ing an encoded qubit-level computation by 
means of the Gottesman-Kitaev-Preskill (GKP) 
error correction scheme for CV cluster states 
(10, 22-24), the only non-Gaussian resource 
required for both universal and fault-tolerant 
quantum computing is GKP logical |0) states, 
which can be inserted into the cluster state at 
regular intervals (25), and no measurements other 
than homodyne measurement are required. 
An n-mode Gaussian pure state |) can be ef- 
ficiently characterized by a list of n-independent 
linear nullifiers, which are linear combinations 
of the quadrature operators that have |y) as 
their mutual zero-eigenspace. Nullifiers also 
play an important role in verifying genuine 


Fig. 2. Quantum computation with our cluster state. (A) Equivalent 
quantum circuit that is implemented when our state is used. We show the 
case for five inputs in which 40 light modes (10 temporal mode indices) of 
2D cluster states are used. The number in each box is the index of the 
measured temporal modes. (B and C) The circuit is composed of multiple 
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multipartite inseparability for experimentally 
generated states that can be generated from 
two-mode squeezed states by means of a se- 
quence of beam splitters (10). Such states are 
approximately nullified by linear combina- 
tions of quadratures that are either all of 
position- or momentum-type. By measuring 
these operators, if the states are sufficiently 
highly squeezed, then genuine multipartite 
inseparability can be verified with the van 
Loock-Furusawa criterion. 

Our state can be characterized in this way 
by measuring 
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where #, and p', are quadrature operators at 
temporal mode index & and at spatial index /. 
Shown in Fig. 3 are the quadrature values and 
quantum correlations of quadratures corre- 
sponding to each type of nullifier of the first 
50 temporal mode indices. These quadrature 
values are obtained by processing the time- 
domain electrical signal from each homodyne 
detector and taking appropriate linear com- 
binations. We observed strong correlations 
between quadrature values from different tem- 
poral modes, which are qualitative evidences 
of quantum entanglement of our states. By 


HD-C 


BS-5 HD-D_ To Feed-forward 
Nf = — Out 
HD-B 
In1 
Out 1 BS-4 HD-A To Feed-forward 
Out 2 I Out 1 
BS-3 
In2 Out 2 


quantum teleporters: (B) one-mode operation and (C) two-mode operation. 
Two-mode operations can be turned off by selecting the same measurement 
basis for HD-A and HD-B. Classical feed-forward does not have to be implemented 
immediately after homodyne measurements and can be delayed to the end 

of the computation for Gaussian-only computations. 
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Fig. 3. Quadrature values and four types of quadrature correlations of the first 
50 temporal mode indices. (A and B) Single-shot quadrature values of x), and pi, 
obtained by processing time-domain signals from homodyne detector, (j = A, B, C, 
D). (C to F) Correlations of quadrature values corresponding to 5), gh"), 5) , 
and §P), respectively. Although the quadrature values measured at each 


Asavanant et al., Science 366, 373-376 (2019) 


10 20 30 40 50 


Temporal mode index k 


18 October 2019 


ie) 10 20 30 40 50 


' 
a 
Ls) 


Temporal mode index k 


homodyne detector seemed to be just fluctuating randomly around zero, 
we observed four types of strong quantum correlations between 

six quadrature values with different temporal mode index k and spatial 
index j. The quadrature values are plotted by using the unit at which 
variance of the vacuum state is equal to 1. 
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Fig. 4. Verification of generated cluster state for 24,960 temporal modes. (A to D) Measurement results for each type of nullifier: ge) gO), § 


k 


and gP, respectively. Black points indicate measured variances of shot noise, which are used as reference levels. Blue points indicate measured variances of 
nullifiers. Purple regions indicate regions where the variances of the nullifiers are below -4.5 dB compared with shot noise, which indicate entanglement. The variances 
of four types of nullifiers satisfied quantum inseparability criteria up to k = 6240 corresponding to 4 x 6240 = 24,960 temporal modes. 


applying the van Loock-Furusawa criterion, 
we verified the 2D entanglement structure of 
the state if we observed that the variances of 
all the nullifiers were below -4.5 dB compared 
with shot noise (7). 

The measurement results for each nullifier 
type for each temporal mode index & are shown 
in Fig. 4. All nullifiers were observed to be 
below -4.5 dB for up to k = 6240. Because 
there are four micronodes at each temporal 
index k, the states we verified possess 24,960 
micronodes. Because one macronode (or one 
site) consists of four micronodes and we used 
N = 5, the structure of the state we verified is a 
5- by 1248-site 2D square lattice with cylindri- 
cal boundary conditions. The means of the 
variances for each type of nullifier are -4.82 + 
0.06 GB, -5.34 + 0.06 dB, -4.81 + 0.06 dB, and 
-4.93 + 0.06 dB, respectively. These values are 
limited by the original squeezing level from 
the squeezed light sources, optical losses and 
fluctuations in the optical system, and elec- 
trical noises from homodyne detectors and 
are in good agreement with the experimental 
parameters. Statistical errors are the main 
contributors to the error bars, which can be 
arbitrarily decreased by increasing the num- 
ber of events used for calculating the nullifiers. 
There are no corrections for experimental im- 
perfections, and the nullifiers do not degrade 
with the increasing k, suggesting that & can be 
arbitrarily large. 

Thus, we have proposed and verified the 
generation of a universal resource state for 
MBQC. Using this cluster state for large-scale 
MBQC requires a few additional steps. First, 
because the delay line ratio N = 5 sets the num- 
ber of inputs in the effective quantum circuit, 
this must be increased. Second, because finite 
squeezing level sets the noise level when clus- 
ter states are used in MBQC (20), the squeez- 
ing level must be improved to be above the 
fault-tolerant threshold (22). It is possible to 
increase the delay line ratio by increasing the 
bandwidth of the squeezed light source, which 
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reduces wave packet size, and development 
of a low-loss optical delay line. Even with cur- 
rently available technology (26, 27), we expect 
a number of the input modes on the order of 
10* to be achievable, and if we consider the 
ultimate limit set by coherence time of the 
light source, the number of the input modes 
could be potentially increased to ~10'° modes 
by use of a narrow-linewidth laser (28). Re- 
garding improvement of the squeezing level to 
above the fault-tolerant threshold, the squeez- 
ing level of our cluster state can potentially 
reach -15 dB with the state-of-the-art squeezed 
light source (29), which begins to be within 
reach of the known thresholds (-15 to -17 dB) 
for fault tolerance in this architecture by using 
particular quantum-error-correcting codes (30). 
Moreover, it has recently been shown that it is 
possible to further relax the threshold to about 
-10 dB with analog quantum error correction 
and postselection (23). Therefore, we believe 
that demonstration of our cluster state pro- 
vides a feasible way toward realization of a 
practical quantum computer. 
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Isolated cortical computations during delta waves 
support memory consolidation 


Ralitsa Todorova and Michaél Zugaro* 


Delta waves have been described as periods of generalized silence across the cortex, and their 
alternation with periods of endogenous activity results in the slow oscillation of slow-wave sleep. Despite 
evidence that delta waves are instrumental for memory consolidation, their specific role in reshaping 
cortical functional circuits remains puzzling. In a rat model, we found that delta waves are not periods 
of complete silence and that the residual activity is not mere neuronal noise. Instead, cortical cells 
involved in learning a spatial memory task subsequently formed cell assemblies during delta waves in 
response to transient reactivation of hippocampal ensembles during ripples. This process occurred 
selectively during endogenous or induced memory consolidation. Thus, delta waves represent isolated 
cortical computations tightly related to ongoing information processing underlying memory 


consolidation. 


ost of our time spent asleep is dom- 
inated by slow oscillations (0.1 to 1 Hz), 
when cortical neurons synchronously 
alternate between a depolarized (up) 
state associated with high levels of 
endogenous activity and a hyperpolarized 
(down) state when neurons remain silent 
(1). Delta waves are large deflections of the 
local field potential (LFP) that correspond 
to the down states of the slow oscillation 


Fig. 1. Delta spikes. A 
(A) Example of a nonsilent 

delta wave. Colored curves 
indicate LFPs recorded 
from the medial prefrontal 
cortex (color: recording 
channel). Colored vertical 
ticks indicate spikes emitted it 
by simultaneously recorded 
prefrontal units (color: 
channel from which the 
unit was recorded). : 
Dashed lines indicate the i 
beginning and end of 
delta waves. A delta spike 
(black circle) occurs during 1 
the second delta wave, 
when the rest of the 
network remains silent. 
Black calibration bars: 0.5 s ; 
(horizontal); 1 mV (vertical). pe : it 
mPFC, medial prefrontal ome tt 
cortex. (B) Mean perievent : om 
time histogram of the 
normalized firing rate of 


mPFC units 


prefrontal units centered on delta waves (top curve: mean field event). The 
dashed white line indicates residual activity during delta waves. (C) Time 
distribution of the spikes emitted by each prefrontal neuron closest to each 
delta wave. The large peak at ~100 ms corresponds with activity in the up state. 
The smaller peak consists of spikes occurring during delta waves. The dashed 
line indicates the 15-ms upper threshold used to define delta spikes in 
subsequent analyses (all results were confirmed by using +30-ms time windows). 
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and are thus considered periods of gener- 
alized cortical silence. The slow oscillation 
plays a causal role in memory consolidation 
(2-5), in particular by orchestrating an in- 
formation flow between the hippocampus and 
the neocortex (6). Indeed, delta waves tend to 
occur in close temporal proximity to hippo- 
campal ripples (7), which are instrumental 
for memory consolidation (8, 9). Hippocam- 


pal replay of awake activity (10), biased by 
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inputs from sensory cortices (//, 12), initiates 
reactivation of prefrontal cortical cell assem- 
blies (13, 14) just before the occurrence of a 
delta wave (7). Cortical synaptic plasticity 
subsequently takes place during network 
reorganization early in the following up state 
(5, 16) and during the massive calcium entry 
accompanying the ensuing sleep spindle 
(7-19). This hippocampo-cortical dialogue 
(20-22) is instrumental for memory consoli- 
dation (5). However, the incursion of gener- 
alized silence (delta wave) precisely between 
periods of information exchange and periods 
of network plasticity remains puzzling. 

We recorded prefrontal cortical activity in 
nine rats during slow-wave sleep (5). Con- 
sistent with previous reports, most delta waves 
were accompanied by neuronal silence. Yet, 
occasionally, spikes did occur during delta 
waves (Fig. 1A), and when considering cumu- 
lative spiking activity over all recorded delta 
waves, unexpected residual activity appeared 
at the peak of the waves (Fig. 1, B and C) (spike 
waveforms recorded during delta waves were 
not distinguishable from spike waveforms 
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(D) (Left) Number of units that discharged in a given proportion of delta waves 
(gray curve: log-normal fit with the same mean and variance as the data; 

error bars: 95% confidence intervals from bootstrapped data). No unit fired 

in 0% of the delta waves. (Right) Number of delta waves containing a 

given number of delta spikes. (E) No difference in average waveforms between 
silent (black: n = 101,161) and nonsilent (blue: n = 12,205) delta waves (Monte 
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recorded outside delta waves; fig. S1). On 
closer examination, neuronal activity occurred 
consistently in a substantial fraction of delta 
waves (12%), wherein one or a few neurons 
remained active while the rest of the popu- 
lation became silent (Fig. 1D). We call this 
unexpected persisting activity “delta spikes.” 
To investigate whether delta spikes were re- 
stricted to a particular subset of neurons, we 
counted the number of delta waves in which 
each unit emitted one or more spikes. As it 
happened, every single recorded unit fired 
during delta waves, suggesting instead that 
persisting firing may actually constitute a 
widespread phenomenon (Fig. 1D and fig. $2). 

We then wondered whether delta spikes 
tended to occur in specific delta waves with 


distinctive characteristics. We thus compared 
delta waves in which we did or did not detect 
cortical spikes and found no significant dif- 
ference between the two groups in terms of 
waveform (Fig. 1E), duration, timing (fig. $3), 
depth (fig. S4), decreased gamma power, or 
coupling with hippocampal ripples and tha- 
lamocortical spindles (fig. S5). This suggests 
that spikes could take place during virtually 
all delta waves but may remain undetected 
given the limited number of recorded neu- 
rons relative to the entire population (fig. S6). 
We thus hypothesize that firing during delta 
waves might be an overlooked phenomenon 
manifested in possibly all delta waves. 

These findings indicate that during any 
given delta wave, the cortical network becomes 


silent except for a small but ever-changing 
minority of cells. The most parsimonious 
explanation would be that delta spikes consti- 
tute random activity reflecting imperfect co- 
ordination in the cortical alternation between 
the up and down states. Yet, an alternative 
possibility is that this activity serves a well- 
defined computational function. A hallmark 
of cortical computation is the emergence of cell 
assemblies. We thus tested for the presence 
of recurring coactive cell ensembles, using 
two complementary approaches. As a first 
approach, we performed a standard indepen- 
dent component analysis (23), which identified 
multiple significant components that were 
active during delta waves (fig. S7, A and B). 
However, these components were likely to 
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and preceding hippocampal activity. Blue traces indicate LFPs from the 
medial prefrontal cortex. Colored ticks indicate simultaneously recorded 
prefrontal (blue) and hippocampal (red) spikes. Black circles indicate 
delta spikes emitted within +15 ms of the delta wave peak (shaded area). 
In the first two delta waves, delta spikes were recorded from the same 


during delta waves on the basis of preceding hippocampal ripple activity 
(200-ms window). Delta spikes as well as delta components were significantly 
predicted by multiple single-unit hippocampal activity (P = 0.0403 for 

delta spikes, P = 0.0052 for delta components; Wilcoxon rank sum test) 

but not by global hippocampal drive ignoring cell identity (summed 


unit after similar hippocampal activity patterns. HPC, hippocampus. 

(B) Cross-correlations (curves and shaded areas, mean + SEM; orange, 
data; gray, time-shifted control) between hippocampal ripple activity 
(sliding window) and delta spikes (fixed, 0 s). The horizontal orange line 
indicates the Monte Carlo test: P < 0.05. corr., correlation. (©) Enrichment 
in positive correlations [comparative distribution between data and control in 
(B)] when hippocampal activity was correlated with subsequent prefrontal 
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activity, P = 0.2597 for delta spikes and P = 0.3258 for delta components; 
Wilcoxon rank sum test). **P < 0.01; ***P < 0.001, (Wilcoxon signed-rank 
tests). (E) Object responsivity index for partner (green) versus other 

(gray) prefrontal units (curves: cumulative distributions; inset: mean + SEM). 
Only partner prefrontal units showed positive object responsivity (partner 
units, P = 0.0162; other units, P = 0.5967; Wilcoxon signed-rank test; partner 
versus other units, P = 0.0465; Wilcoxon rank sum test). *P < 0.05. 
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combine multiple smaller but overlapping cell 
ensembles, given the limited number of neu- 
rons active in any given delta wave. We thus 
performed a second analysis to examine coop- 
erative activity (“peer prediction”) (24) among 
delta spikes, an idiosyncratic property of cell 
assemblies. This showed that the delta spikes 
of one neuron could be predicted from the 
delta spikes of other neurons (fig. S7C). 


We then asked whether delta spikes were 
involved in the hippocampo-cortical dialogue 
underlying memory consolidation. Because 
delta waves typically take place precisely be- 
tween hippocampal replay and cortical reor- 
ganization for memory consolidation, this 
hypothesis would be expected to have two 
implications: (i) Hippocampal activity during 
ripples should predict which neurons (or which 


assemblies) are active during the following 
delta wave, and (ii) this predictive bias should 
emerge after behavior, and predictable cortical 
cells should be involved in the reactivation of 
waking experience. 

Rats were trained on a spatial memory task, 
and hippocampal and cortical activity was 
recorded during both behavior and subsequent 
memory consolidation during the first 2 hours 
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Fig. 3. Delta waves isolate cortical computations. (A) Cross-correlations 
(curves and shaded areas, mean + SEM) between hippocampal ripple 
activity (fixed, Os) and prefrontal activity (sliding wi 
cross-correlations (orange) were significantly different from a time-shifted 
control (gray) for cortical activity after ripples (horizontal orange line: 


Monte Carlo test, P < 0.05). Delta waves (dashed lin 
130 ms after ripples) tend to occur within the critical 
activity remains correlated with the preceding ripple 


neous recording of prefrontal and hippocampal activi 
(gray-shaded rectangle). (Top) Proportion of prefrontal spikes predicted by 
the firing of hippocampal cells (partner spikes). (Center) Raster plot of spikes 
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other spikes 


emitted by 68 simultaneously recorded prefrontal units (red ticks: partner 
spikes; gray ticks: other spikes). (Bottom) Simultaneously recorded LFPs in 
the mPFC (blue: delta wave) and hippocampus (broadband and ripple-band 
filtered signal; blue: ripples). During delta waves, partner spikes occurred 
in isolation (red ellipse). Partner spikes emitted by the same units outside 
delta waves (gray ellipses) formed a considerably smaller proportion of the 
ongoing cortical activity. Black calibration bar: 0.5 s. Filt, filtered. (©) Signal- 
to-noise ratio (curves and shaded areas, mean + SEM) of partner spikes 
relative to other spikes around delta waves. Observed values (blue) were 
significantly different from a time-shifted control (gray) during delta waves 
(horizontal blue line: Monte Carlo test, P < 0.05). 
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of sleep (5). Hippocampal spiking activity 
during ripples was significantly correlated 
with cortical delta spikes that occurred im- 
mediately (50 to 200 ms) afterward (the effect 
was not due to data recorded from any single 
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rat) (Fig. 2, A and B, and fig. $8). This in- 
creased correlation was due to a large propor- 
tion of positively correlated interregional pairs 
of neurons (Fig. 2C). Furthermore, ripple spikes 
were better correlated with delta spikes than 


with spikes occurring at similar delays during 
an up state (correlations were greater in the 
presence of a delta wave) (fig. S9). A gen- 
eralized linear model (GLM) analysis showed 
that ensemble activity in the hippocampus 
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Fig. 4. Induction of memory consolidation by isolation of partner spikes. 
(A) (Left) Experimental protocol. Delta waves were induced by brief 
single-pulse electrical stimulation of deep cortical layers. Induced delta waves 
were triggered to isolate either partner activity (coupled stimulation; 

green, 130 ms after ripples) or other cortical activity (delayed stimulation; 
purple, 290 to 370 ms after ripples) during sleep after limited training 

on a spatial object-recognition task. (Right) Object discrimination index 
during the recall phase. Only delta waves triggered to isolate partner activity 
(coupled stimulation) resulted in memory consolidation and enhanced task 
performance. (B) Performance of a GLM trained to predict delta spikes on the 
basis of preceding hippocampal activity (200-ms window), measured as 
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percent improvement relative to a shuffled control (prediction gain). Only 
delta waves triggered to isolate partner activity resulted in a significant 
prediction of delta spikes (P = 0.0030 for isolation of partner spikes by coupled 
stimulation; P = 0.1301 for isolation of other spikes by delayed stimulation; 
Wilcoxon rank sum test). stim, stimulation. **P < 0.01. (€) Cross-correlation 
(curves and shaded areas, mean + SEM) of hippocampal activity and delta 
spikes (green: isolation of partner spikes by coupled stimulation; purple: 
isolation of other spikes by delayed stimulation; gray: time-shifted control; 
horizontal green line: Monte Carlo test, P < 0.05). (D) Enrichment in positive 
correlations upon isolation of partner spikes by coupled stimulation (top) 
but not of other spikes by delayed stimulation (bottom). 
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could significantly predict which cortical cells 
would emit delta spikes (Fig. 2D). In contrast, 
delta spikes could not be predicted from the 
combined activity of all hippocampal units 
that ignored cell identity (multiunit activity), 
ruling out the possibility that delta spikes 
merely reflect the overall level of hippocam- 
pal excitatory drive during ripples. Finally, 
ripples facilitated (but did not entirely con- 
trol) the formation of delta assemblies (fig. $7). 
Furthermore, the same GLM analysis applied 
to hippocampal and cortical ensembles showed 
that hippocampal activity could even predict 
delta components (Fig. 2D). 

Our second prediction concerned the rela- 
tion of this predictive bias to behavior. In sleep 
sessions preceding the task, hippocampal spik- 
ing activity during ripples failed to predict sub- 
sequent delta spikes or assemblies (fig. S10), 
indicating that the predictive bias emerged 
after task performance. We then investigated 
the behavioral correlates of the prefrontal units 
whose delta spikes were significantly predicted 
by hippocampal ripple activity during sleep 
after behavior (“partner cells”) (fig. S11 and 
tables S1 and S82). These cortical cells displayed 
higher levels of task-relevant firing during 
behavior (Fig. 2E) (we failed to find a simi- 
lar effect for delta components, possibly be- 
cause of low statistical power due to their 
limited number: n = 14 components, n = 9 
predicted). We further investigated the be- 
havioral correlates of delta assemblies and 
found that these assemblies were also ex- 
pressed during task performance but not 
outside delta waves nor in sleep that preceded 
behavior (fig. S12). 

These observations suggest that in addition 
to triggering the reorganization of cortical 
subnetworks during the transition to the 
up state (5), an unsuspected role of the delta 
wave may be to isolate from interference spe- 
cific cortical computations taking place in 
response to hippocampal replay. Consistent 
with this idea, delta waves typically occurred 
within a critical time window when cortical 
activity remained correlated with hippocam- 
pal ripple activity (Fig. 3A). To test whether 
delta waves tended to preferentially silence 
cortical activity that was related to the on- 
going hippocampo-cortical dialogue, we clas- 
sified individual prefrontal spikes as “partner 
spikes” if they followed spikes emitted by 
their significantly correlated hippocampal 
units or “other spikes” if they were unrelated 
to the preceding hippocampal activity (Fig. 
3B) (more examples are shown in fig. S13). 
The signal-to-noise ratio for partner spikes 
peaked during delta waves (Fig. 3C), and this 
was due to the selective silencing of non- 
partner activity during delta waves (fig. S14). 
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Does this isolation of cortical computations 
play a critical role in memory consolidation? 
A prediction of this hypothesis is that isolat- 
ing cortical assemblies by experimental induc- 
tion of delta waves should trigger memory 
consolidation, but only if the isolated activity 
is relevant to the hippocampo-cortical dialogue 
(partner spikes). We have already shown that 
triggering delta waves when endogenous mech- 
anisms fail to do so can boost memory con- 
solidation, provided that the delta waves are 
induced in an appropriate time window (Fig. 
4A) (5). We thus sought to confirm the pre- 
diction that these delta waves actually isolated 
partner spikes (that delta spikes did occur 
during induced delta waves and that they 
were predicted by hippocampal activity). Sim- 
ilar to our observations in natural sleep (above), 
stimulation-induced delta waves did feature 
spiking activity (fig. S15), and these delta 
spikes were predicted by preceding hippo- 
campal activity coinciding with the timing 
of ripples (Fig. 4, B to D). In contrast, slightly 
delaying the induction of delta waves (by 
~200 ms) (Fig. 3A) to isolate nonpartner 
delta spikes (Fig. 4, B to D) failed to induce 
memory consolidation (5). 

Our results challenge the generally accepted 
tenet that delta waves, reflecting the down 
states of the sleep slow oscillation, are periods 
of complete cortical silence (/, 6, 25), to the 
point that they have sometimes been defined 
as such (26, 27) and that occasional spikes 
have been routinely ignored when detected 
(28, 29). We focused on delta spikes and 
found that they are not neuronal noise due 
to imperfect silencing of the cortical man- 
tle. On the contrary, they constitute a com- 
mon phenomenon potentially implicating 
all neurons and all delta waves, and they re- 
flect genuine processing involved in memory 
consolidation. 

This observation also provides a mechanism 
for the documented but puzzling role of delta 
waves in memory consolidation: Synchron- 
ized silence across most of the cortex isolates 
the network from competing inputs while 
a select subpopulation of neurons maintains 
relevant spike patterns between epochs of 
hippocampo-cortical information transfer 
(10, 12, 14) and epochs of cortical plasticity 
(15, 16) and network reorganization (5, 18, 19). 
Yet, in many cases, cortical activity during 
delta waves could not be reliably predicted 
from the preceding hippocampal ripple activ- 
ity. Such cortical activity could instead have 
been related to interactions with other brain 
networks. This suggests that delta spikes and 
assemblies might constitute a general mech- 
anism of isolated cortical computation beyond 
the hippocampo-cortical dialogue. 
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Synthetic dissipation and cascade fluxes in a 


turbulent quantum gas 


Nir Navon"*, Christoph Eigen’, Jinyi Zhang”, Raphael Lopes“t, Alexander L. Gaunt”°, 
Kazuya Fujimoto**+, Makoto Tsubota®, Robert P. Smith”, Zoran Hadzibabic” 


Scale-invariant fluxes are the defining property of turbulent cascades, but their direct 
measurement is a challenging experimental problem. Here we perform such a measurement 

for a direct energy cascade in a turbulent quantum gas. Using a time-periodic force, we inject 
energy at a large length scale and generate a cascade in a uniformly trapped three-dimensional 
Bose gas. The adjustable trap depth provides a high-momentum cutoff kp, which realizes a synthetic 
dissipation scale. This gives us direct access to the particle flux across a momentum shell of radius 
kp, and the tunability of kp allows for a clear demonstration of the zeroth law of turbulence. 
Moreover, our time-resolved measurements give unique access to the pre-steady-state dynamics, 
when the cascade front propagates in momentum space. 


he discovery of a universal law describ- 
ing the transfer of energy from large to 
small length scales in turbulent flows was 
a conceptual breakthrough (J, 2). Despite 
their complex spatiotemporal dynamics, 
turbulent flows often obey a simple generic 
picture: The energy injected into the system 
at a large length scale is gradually transferred 
to ever smaller ones, flowing locally in Fourier 
space through the so-called inertial range where 
no dissipation occurs, until it is dissipated at 
some small length scale. In Fig. 1A, we depict 
such turbulent-cascade dynamics for a com- 
pressible field in real space. The field is ini- 
tially at rest. At times ¢ > 0, an external force 
creates excitations at a large length scale 1/xp. 
These excitations propagate to smaller length 
scales owing to their nonlinear interactions. 
Once they first reach the dissipation scale 
1/kp, at time tg, the field fluctuates on all 
length scales from 1/k, to 1/Kp. If a steady 
state is established within the momentum 
range ky to kp, from thereon energy is dis- 
sipated at Xp at the same rate at which it is 
injected at ky. In such a steady state, the 
momentum-space distributions of quantities 
such as the energy or wave amplitude are 
generically scale-free power laws. 
Many quantitative theoretical predictions 
about turbulence are based on taking the 
mathematical limits kp — 0 and kp > © (3). 
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Such formal treatments lead to predictions 
that are elegant but often counterintuitive. 
Akey prediction of this kind is that for kp — ~, 
the steady-state cascade corresponds to a 
scale-invariant (k-independent) energy flux 
through momentum space, but no particle 
flux (4). 

Experimentally, the steady-state power-law 
spectra of various quantities have been exten- 
sively studied (5-9), whereas the equally funda- 
mental cascade fluxes are harder to measure 
(10-13). Recently, ultracold atomic gases have 
emerged as a versatile platform for studies of 
turbulence (9, 14-22), offering experimental 
possibilities unavailable in other systems. 
Here, we use an atomic gas to directly mea- 
sure cascade fluxes in a turbulent system. Our 
dissipation scale is tuneable, which allows us 
to explore how the fluxes depend on Xp, and 
to reconcile the experimental observations with 
the formal predictions for kp — ~». Our system 
also allows a time-resolved study of the initial 
stage of turbulence (23-25), when a steady state 
is not yet established, which reveals how the 
cascade front propagates in momentum space. 

Our experiment starts with a weakly inter- 
acting Bose-Einstein condensate of N ~ 1.2 x 
10° atoms of *’Rb in the uniform potential of 
a cylindrical optical box trap of radius R ~ 16 um 
and length L = 27 um (Fig. 1B) (26). At the 
end of the initial preparation of the gas, the 
noncondensed fraction is <10% and the chem- 
ical potential is u + kg x 2 nK, corresponding 
to a healing length € ~ 1.2 um « R, L. We 
initiate a turbulent cascade by injecting en- 
ergy at the system-size length scale (corre- 
sponding to a small momentum #7), using a 
spatially uniform force F,(r, t) = Fosin(st)x, 
where X is a unit vector along the box symme- 
try axis, Fol = kp x 2.5 nK, and @, = 2x x 9 Hz 
is tuned to resonantly excite the sound wave of 
wavelength 2L (so kg = n/L) (27). This aniso- 
tropic forcing of the matter-wave field is 
represented in Fig. 1C as asmall dark blue 
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area elongated along k,. As shown in (9), 
after several seconds of shaking, the momen- 
tum distribution of the gas in the inertial range 
is statistically isotropic and time-invariant, 
nk, t) = (n(k)) < k&, with y = 3.5 (28, 29). 
The time invariance implies that the energy 
and particle fluxes through this k-range are 
k-independent, but it does not reveal their 
values. Here, we extract the cascade fluxes by 
studying the dissipation in our gas. 

In conventional fluids, one observes macro- 
scopic (hydrodynamic) degrees of freedom, and 
the dissipation occurs in the form of heating 
(i.e., transfer of energy into the microscopic 
degrees of freedom). This dissipation is set by 
the viscosity v, which is generally not tune- 
able. Moreover, the resulting minute heating 
is often difficult to measure because of ther- 
mal coupling of the fluid with its surround- 
ings (30). Our system is thermally isolated 
from the environment, and we have direct 
access to all the microscopic degrees of free- 
dom, so the dissipation occurs only in the 
form of (readily measurable) particle loss. 
The optical box (Fig. 1B) has a non-infinite 
energy depth Up, so particles with a suffi- 
ciently large energy leave the box; in momen- 
tum space, Up corresponds to a sphere of 
radius kp = \/2mUp /h (Fig. 1C), where m is 
the atom mass and f is Planck’s constant di- 
vided by 2x. This simple feature realizes a 
synthetic dissipation scale, with Up defining 
the particle and energy sink. Crucially, this 
dissipation scale can be tuned by changing 
the trapping laser power (37). 

Formally, within the assumptions of the 
wave-turbulence theory, the equations of 
motion lead to a continuity equation, with a 
source and a sink, that is local in momen- 
tum space (3): 


On(k, t) 
Ot 


= F(k, t) — D(k, t) — Ve - Mn(k, £) 


(1) 


Here Fk, ¢) corresponds to the external force, 
D(k, t) describes the dissipation, and V;, - H, 
captures the nonlinear interactions, where II, 
is the particle flux. For F = D = 0, the steady- 
state solutions are zero-flux equilibrium thermo- 
dynamic states. If F and D are nonzero but 
are localized in & space, one can also get non- 
equilibrium steady-state solutions with a non- 
zero scale-independent flux sustained by the 
source F and the sink D. 

For an isotropic outflow, the total radial 
particle flux is I1,(k) = 4nk?|I,,0k)|. Hence, 
from Eq. 1, in the inertial range 4nk” On/dt = 
-OII,,/Ok. Integrating over k shows that we 
can measure the particle flux through the shell 
at kp by simply counting the atoms remaining 
in the trap (see Fig. 1C): 


ON 


aE —TIn(Kp, t) (2) 
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For a (non-equilibrium) steady state, with 
time-invariant n(k) in the inertial range (9), 
the particle flux is k- and t-independent (32), 
so I1,,(Kp, t) = T1,,(k, 0) = Mp. 

In steady state, the total radial energy flux, 
Tic(k, t), is also k- and t-independent in the 
inertial range, and is equal to the rate of en- 
ergy dissipation. To relate it to I1,,, we consider 
the pertinent case of weakly interacting par- 
ticles with a dispersion relation m(%), so the 
energy spectrum is E(K, t) = ho(k)n(k, t); in our 
case, w(k) & k?. At k < kp, microscopic in- 
teractions drive particles to both lower and 
higher &, so the relationship between the net 
energy and particle fluxes, IT¢ and I1,,, is non- 
trivial; one might naively expect that I¢(k) = 
ho(k)I1,,(k), but this cannot be true if both I¢ 
and I], are k-independent and (A) is not. 
However, at Xp the particles flow only one way 
because there is no “backflow” from the sink 
into the inertial range, so one can intuitively 
write 


Te(kp) = ho(kp) Mn(kp) (3) 


Steady state then requires IT¢e = hw(kp)I,, at 
all & in the inertial range; for our w(4), this 
means that Ile & kp7I,. To formally derive 
Eq. 3, one multiplies Eq. 1 by fw(*k) and in- 
vokes the continuity equation for the energy 
to obtain 


AMe(k, t) OUn(K, t) 
Ok Ok 


in the inertial range. For k < Xp, this equation 
is trivially satisfied by both of its sides being 
zero, and does not impose any relation be- 
tween II¢(k) and I,,(k). However, integrating 
it across a shell around p, and setting n(k) 
and all fluxes to zero for k > kp, recovers Eq. 3. 

Experimentally, we vary kp while keeping 
Fo fixed and measure I1,,(kp) according to 
Eq. 2. To mitigate the effects of the long-term 
few-percent drifts in the initial N, and of the 
additional atom loss through collisions with 
the background-gas particles, we perform dif- 
ferential measurements of the cascade-induced 
atom loss, Nigss, With reference measurements 
taken by setting Fy to zero in an otherwise 
identical experimental sequence. 

In Fig. 2, we show Ns; as a function of the 
shaking time ¢, for various values of Up. In all 
cases at short times, we observe no loss (within 
error). This is consistent with the expectations 
that no losses occur at k < kp &./Up and that 
initially it takes time for the excitations to cas- 
cade to kp, when a steady state can be estab- 
lished (see Fig. 1). For ¢, longer than some onset 
time fg, the loss rate ONjo;</Ot is essentially 
constant in time, as long as the total loss is 
relatively small (<30% of the initial N = 1.2 x 
10°). The dashed lines show piecewise linear 
fits that we use to extract, for each Up, both 
ta and the subsequent initial loss rate, which 
we identify with the steady-state particle flux 


= ho(k) (4) 
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II, = I,,(%p). At much longer times, ¢, > tg, the 
steady-state assumption can no longer hold, 
because the losses gradually deplete the low-k 
source of atoms. 

In Fig. 3, we show a log-log plot of IT,, ver- 
sus Up (31). We observe power-law behavior 
with II, [yp 2058008 we Ip 2102098, We com- 
plement these measurements with numerical 
simulations based on the Gross-Pitaevskii equa- 
tion, for the same forcing protocol and without 
any free parameters [see (27) for details]. The 


A 


numerical simulation results are shown by 
solid circles; a fit to the numerical data gives 
TI, &¢ Up +°*°°! in good agreement with the 
experimental data. 

The so-called zeroth law of turbulence, first 
formulated in the context of classical incom- 
pressible fluids, stipulates that for fixed forcing, 
the steady-state rate of energy dissipation tends 
to a nonzero constant as the viscosity vanishes 
(v — 0) &, 33). In our case, this corresponds to 
keeping Fo fixed and taking kp — © (34). This 
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Fig. 1. Turbulent cascade in a box-trapped quantum gas. (A) Cartoon of real-space dynamics of a 
turbulent wave. Energy is injected by applying a force at a large length scale, 1/kr, and propagates to smaller 
scales because of nonlinear interactions. A steady state can be established when the excitations first reach 
the small dissipation length scale, 1/kp, at time ty. (B) Sketch of the experimental setting. The atoms 

are trapped in a finite-depth potential in the shape of a cylindrical box, formed by laser barriers. The shaking 
force is applied along the x axis. (©) In momentum space, the dissipation scale kp is set by the trap depth; 
when excitations propagate to kp, dissipation occurs in the form of particle loss. 


Fig. 2. Atom-loss dynam- 
ics associated with the 
turbulent cascade. The 
graph shows atoms lost 
versus shaking time t, for 
different trap depths Up 
(at t, = 0, the atom 
number is N = 1.2 x 10°). 
Data points show averages 
of typically 50 measure- 
ments. Dashed lines 

are piecewise linear fits. 
The systematic uncertainty 
in Up values is 20%. 


Atom loss (10°) 
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law implies that the particle flux should vanish 
as I, ~ kp” (see Eq. 3), in excellent agreement 
with our data. Note that the steady-state energy 
balance also requires that IT¢ is equal to the 
rate of energy input into the system, e. How- 
ever, energy conservation alone is not suffi- 
cient to predict the scaling of I], with kp, 


Fig. 3. Steady-state parti- 
cle flux. The atom-loss rate 
TI, versus the dissipation 
energy scale Up (open 
symbols) is shown on a log- 
log plot; the three colored 
points correspond to the 
data shown in Fig. 2. Solid 
symbols show the results of 
numerical simulations (27). 
The systematic uncertainty 
in Up values is 20%. A 
power-law fit to the experi- 
mental data (solid line) 
gives TI, o¢ Up 2952008 ac 
kp 2104016 in agreement 
with the theoretical prediction. 


20 


> 


Fig. 4. Establishing the 
steady state: The 
cascade-front dynamics. 
(A) Momentum-space 
turbulent dynamics. 
Forcing occurs at kp and the 
steady-state distribution 
n(k) is established in the 
wake of the cascade front 
kts), which propagates 
outward until it reaches 

kp at time tg. For clarity, 
here we show an idealized 
sketch with a very large 
separation between ke and 
kp; see (27) for numerical 
simulations with our experi- 
mental parameters. Inset: 
Consistent picture for the 
evolution of the energy flux 
Tl¢ and particle flux 1, for 
three different times, t; 
(blue) < ts (purple) < ts 
(black), with to < ty < ts. The 
forcing and dissipation 
scales are indicated by the 
vertical arrows, as in the 
main panel. (B) Onset time 
for dissipation. Open 
symbols show the measured 20 
ty values versus Up on a log- 

log plot; the three colored 


log n(k) 


fad 


because it is not a priori obvious that for fixed 
Fo the rate at which the system absorbs en- 
ergy from the drive is not affected by changing 
Rp (85). Only a posteriori, from Fig. 3 (and the 
conservation of energy), can we see that in our 
system the steady-state « must be indepen- 
dent of the dissipation length scale down to 


50 100 
Up/ke (nK) 


log k 


50 700 
Up/ke (nk) 


200 


points correspond to the data shown in Fig. 2. Solid symbols show the results of numerical simulations (27). 
The systematic uncertainty in Up values is 20%. A power-law fit, ty °c Up®, to the experimental data 
(solid line) gives B = 0.73 + 0.06, in agreement with the prediction B = 0.75 + 0.05. 
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our lowest Kp. If Xp were changed dynamically, 
for a system to reach a new steady state the 
particle flux would have to self-consistently 
adjust at all kp < k < kp, because the steady- 
state IT,, must be both kp-dependent (to sat- 
isfy the zeroth law) and k-independent for a 
given kp. 

Having established a consistent picture of 
the steady-state fluxes in our gas, we now turn 
to the pre-steady-state turbulent dynamics. In 
Fig. 4A, we depict the early-time dynamics in 
Fourier space. The forcing, which generates a 
surplus of atoms at /p, initiates the cascade at 
t, = 0. As the cascade front, k,¢(t;), propagates 
to higher k, the steady-state momentum dis- 
tribution, n(x) c< k”, is established in its wake 
[see also (27)]. The dynamics are dissipation- 
less until Ae reaches /p (at time tg); only then 
is a steady state, with matching e and IT¢(Kp), 
established. Hence, our experimental observa- 
tions of the initial dissipationless stage of tur- 
bulence (t, < tq), and the dependence of tg on 
Up, give us access to the dynamics of the 
cascade front in momentum space. 

At t, < ta, the instantaneous particle flux 
is k-independent for k < k.¢(t,), vanishes for 
k > kt), and must match the rate of the 
population increase in the inertial range: 
Nee Attkee Ker = T1,(Kep)dts, so Kee Aker oc 
T1,(A-p)dt,. Analogously, for the increase of 
total energy in the inertial range, Kreg YOece 
Tle(K--)dt,, and I¢(k.-) is equal to the instan- 
taneous energy injection rate e. 

Assuming that ¢, which we found not to 
depend on kp in steady state, is also indepen- 
dent of Kk. at t, < tg, then the instantaneous 
TL,(t,), at k < Ker(ts), is © Keg ®. This gives an 
elegant unifying picture of the particle fluxes 
for t, < ta and t, > tg [Fig. 4A, inset, and (27)]: 
II, is always the same function of the highest 
k; for which the steady-state n(k) has been 
established (i.e., the lowest & from which there 
is no backflow), whether that is the instanta- 
neous Ke < Kp (for t, < ta) or kp. This self- 
consistent picture also leads to a quantitative 
prediction that is verifiable in our experiments: 
The time independence of implies Ke’ "der 
dt,, which for y < 5 and Kp > kr gives a power- 
law prediction ta «¢ Up", with B = (5 - y)/2. 
Specifically, for our y = 3.5 + 0.1 (9), we pre- 
dict B = 0.75 + 0.05. 

In Fig. 4B, we show the variation of tg with 
Up. We find that our data are indeed well 
described by a power law, with B = 0.73 + 0.06, 
in agreement with our prediction. The results 
of our numerical simulations (solid circles) 
show similar behavior with a small system- 
atic offset; a fit to the numerical data gives 
B = 0.71 + 0.01. 

Finally, we note that the criterion for tg to 
show scaling behavior, namely y < 5 and hence 
6 > O, is intimately linked to another impor- 
tant concept in the theory of turbulence. For 
y < 5, the steady-state spectrum has infinite 
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energy capacity, meaning that it carries in- 
finite energy for kp — ~. It is indeed gen- 
erally expected for infinite-capacity systems 
that the cascade front propagates at a finite 
speed and that the Kolmogorov-Zakharov 
turbulence spectrum forms behind it (36). 


Our work provides a complete, consistent 


picture of the wave turbulence dynamics at 
both short (pre-steady-state) and long (steady- 
state) times. In the broader context of far-from- 
equilibrium many-body quantum systems, a 
turbulent quantum gas with a large kp pro- 
vides a particularly interesting example of an 
essentially stationary nonthermal state [see 
also (37-39)]. The possibility of synthetic dissi- 
pation also opens new theoretical perspectives. 
In the future it would be interesting to engineer 
arbitrary momentum-cutoff landscapes, which 
could, for example, allow studies of anisotropic 
turbulence. By dynamically tuning the dissipa- 
tion scale or the driving force, it should also be 
possible to study quenches between different 
turbulent states. 
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new products: protein analysis 


Protein Analysis System 
Fluidic Analytics’ Fluidity One-W assesses 
on-target protein interactions in solution, 
even in crude biological backgrounds 
such as cell lysates or blood plasma. This 
capability can produce detailed analyses 
of proteins in near-native states and 
| in natural environments—facilitating 
earlier, more accurate analysis of targets 
for faster decision-making. Fluidic 
Analytics has made this possible with its proprietary diffusional 
sizing technology, which is based on the well-understood 
relationship between size and diffusion rate, and yields absolute 
size measurements. The measurements provided by the Fluidity 
One-W confirm the identity of complexes, allowing scientists to 
control for off-target binding and false positive measurements, 
and delivering zero ambiguity for the utmost confidence in 
results. The instrument's unique abilities and high sensitivity 
enable researchers to study proteins that are traditionally 
problematic for other systems, including membrane proteins, 
multiprotein complexes, and intrinsically disordered proteins. 
Fluidic Analytics 
For info: +44-(0)-01223-560-432 
www.fluidic.com 


Facic 


E. coli Host Cell Protein Kit 

Gyros Protein Technologies offers a host cell protein (HCP) kit 
for automated impurity analysis of biotherapeutics expressed in 
Escherichia coli systems. The kit is optimized for use in Gyrolab 
systems and has been developed as part of a licensing and 
supply agreement with Cygnus Technologies (part of Maravai 
LifeSciences), incorporating their industry standard E. coli HCP 
antibodies and other reagents. The Gyrolab E£. coli HCP Kit 
quantifies HCP impurities from E. coli expression systems used in 
the production of biotherapeutics, a critical step in ensuring the 
efficacy and safety of the drug molecule. 

Gyros Protein Technologies 

For info: 800-477-6834 

www.gyrosproteintechnologies.com 


PD-L1 Antibody Panel 

In order to give you the best chance of finding the optimal 

PD-L1 clone for your research assay, you can now get access to 
four key recombinant monoclonal PD-L1 antibody clones from 
Abcam (73-10, 28-8, SP142, and CAL10), as well as the anti-mouse 
recombinant rabbit monoclonal EPR20529 and the mouse 
monoclonal PDL1/2746. PD-L1 (also Known as B7-H1, CD274) is a 
ligand that binds to PD-1 (programmed cell death-1), expressed on 
activated T cells, to evade antitumor responses. PD-L1 plays a role 
in inhibiting T-cell activation and proliferation and has emerged as 
an important target in cancer treatment. PD-L1 protein detection 
by immunohistochemistry (IHC) testing is widely used as a 
predictive biomarker assay for anti-PD-1/PD-L1 therapies. You can 
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now access all four anti-human recombinant rabbit monoclonal 
antibody clones in our trial-size panel kit. 

Abcam 

For info: 888-772-2226 

www.abcam.com 


Protein Standards 

Atlas Antibodies introduces prequantified QPrEST Protein 
Standards for absolute quantification of proteins in biological 
samples such as cell lysate and plasma, using LC/MS. QPrESTs 
consist of 50-150 amino acids identical to a part of the 
corresponding human protein. They are produced recombinantly 
in the presence of stable, isotope-labeled (i.e., heavy) amino acids. 
One advantage of QPrESTs over other available protein standards 
is that they are added to the sample early in the workflow, prior 
to proteolytic digestion. The shared amino acid sequence of 

the QPrEST and the endogenous protein enable the QPrEST to 
correct for variation introduced during sample digestion, thereby 
increasing quantification accuracy. 

Atlas Antibodies 

For info: +46-(0)-8-54-59-58-50 

www.atlasantibodies.com 


Custom Protein Service 

OriGene offers comprehensive, high-quality recombinant custom 
protein services from different expression systems, in addition to 
the standard catalog items. Our proprietary protein expression/ 
purification technology maximizes the purity and yield of target 
proteins in different systems. We offer a complete “start-to-finish” 
service from the complementary DNA (gene) cloning step to the 
protein purification step, and downstream modifications such as 
tag addition/removal, endotoxin testing/removal, conjugations, 
and any other services you may request. Choose OriGene as 
your reliable partner for protein-related research, and we can 
help you accelerate your discovery in a timely, cost-effective, and 
affordable manner. 

OriGene 

For info: 888-267-4436 
www.origene.com/products/proteins/custom-service 


HCP ELISA Detection Kits 

AMS Biotechnology has launched a range of ELISA Host Cell 
Protein (HCP) detection kits that allow you to discover host cell 
protein impurities in your therapeutic at any phase of product 
development. The kits are based on a sandwich ELISA test and 
come in a convenient, 96-well plate format. Specific HCP detection 
kits are available for popular expression systems, including CHO, 
Escherichia coli, Pichia, HEK293, and Protein A. Each kit allows 
quantification of up to 24 samples. Our HCP detection kits offer 
5-10 times more sensitivity, greater antibody coverage, and lower 
lot-to-lot variations as compared to alternative kits, resulting in 
amore consistent, reliable antibody supply and a reduction in 
biomanufacturing costs. 

AMS Biotechnology 

For info: 800-987-0985 

www.amsbio.com 
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FIU « Arts, Sciences Walter and Rosalie Goldberg 
& Eoucenion Professorship in Tropical 
Ecology and Conservation 


The Beane of ete Sciences at Florida International University 
(FIU, http://biology.fiu.edu) seeks a top scholar to fill the Walter and Rosalie 
Professorship in Tropical Ecology and Conservation. We seek qualified 
individuals whose area of expertise is sustainability, resilience, impact of invasive 
species, artificial materials, or other aspects of human disruption of terrestrial, 
aquatic, or marine tropical ecosystems. The ideal candidate will exhibit an 
interdisciplinary approach to research that will complement existing expertise 
either within the Department of Biological Sciences or between Biology and 
allied science departments. The Department of Biological Sciences, (http:// 
biology.fiu.edu/) - in the School of the Environment, Arts, and Society (seas. 
fiu.edu) within the College of Arts, Sciences & Education has ~5,000 majors 
and 120 graduate students in fields ranging from cell and molecular biology 
to evolution and ecology. Tropical Research is an area of preeminence at FIU 
with several institutes and centers relevant to tropical ecology including the 
International Center for Tropical Botany (ictb.fiu.edu), the Institute of Water 
and Environment (inwe.fiu.edu), the Tropical Conservation Institute (tci-fiu. 
edu), the Southeast Environmental Research Center (serc.fiu.edu), The Sea 
Level Solutions Center (slsc.fiu.edu), The Center for Coastal Oceans Research 
(ccor.fiu.edu), and the International Hurricane Research Center (www.ihre. 
fiu.edu). The successful candidate will be expected to have an exceptional 
publication record, bring and maintain an active externally funded research 
program, participate in the Biology graduate program, as well as teach a course 
in their area of expertise. The minimum requirements are a doctoral degree from 
an accredited institution, associate or full professor or equivalent status, and 
demonstrated record of achievement in academic research, teaching and service. 


Qualified candidates are encouraged to apply to Job Opening ID 519325 at 
facultycareers.fiu.edu and attach a cover letter, curriculum vitae, and 
statements of research, teaching, and service as a single pdf file. Please also 
provide names and contact information for at least three references who will 
be contacted upon submission of application. To receive full consideration, 
applications and required materials should be received by November 18, 
2019. Review will continue until position is filled. Please direct inquiries 
about this search to the chairman of the search committee, Dr. Joel Trexler 
trexlerj@fiu.edu. 
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Marine Biological Laboratory 
2020 Whitman Center Fellowships 


THE MARINE BIOLOGICAL LABORATORY, a hub for research 
and education and an affiliate of the University of Chicago, 
convenes biologists from around the world each year to advance 
the mission of biological discovery. We are now accepting 
applications for Whitman Center Fellowships for 2020. Support 
is available for scientists to come to the Marine Biological 
Laboratory for 4 to 10 weeks to conduct research, year-round. 


We particularly encourage applications from individuals or 
collaborative groups focused on the following: 


+ Evolutionary, genetic, and genomic approaches in 
regenerative and developmental biology, microbiomes, and 
neuroscience with an emphasis on marine organisms 


+ Integrated imaging and computational approaches to 
illuminate cellular function and biology emerging from the 
study of marine and other organisms 


+ Integrated approaches to the study of microbial 
communities and marine organisms in coastal communities. 


The 2020 Whitman Center Fellowships include Whitman Center 
Early Career Fellowships, specifically designated for individuals 
less than 10 years from their doctoral degree who wish to focus 
on these areas of biological discovery, and the Whitman Center 
E.E. Just Fellowships for underrepresented minorities in science 
or individuals from HBCUs or MSls. 


Whitman Center Fellowships cover laboratory rental and 
housing costs. The MBL offers access to state-of-the-art 
instrumentation, innovative imaging technology, DNA 
sequencing and informatics, year-round availability of model 
freshwater and marine organisms, and modern laboratory 
facilities. 


The Marine Biological Laboratory hosts more than 1,000 
researchers, postdocs, and graduate students from around the 
world to participate in scientific discovery courses, research, 
lectures, and field studies. As a convener of biology, the Marine 
Biological Laboratory is well known for fostering a highly 
collaborative environment, with scientists and students engaged 
in intensive research in a collegial and informal atmosphere. 


Eligible applicants must hold appointments at accredited 
universities, colleges or research institutions anywhere in the 
world. While applications will be evaluated on the basis of 
scientific merit, we are especially interested in individuals from 
diverse backgrounds, experiences and perspectives, especially 
those under represented in science. 


mbl.edu/research/whitman-fellowships 
research@mbl.edu 
Application Deadline: December 15 
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THE UNIVERSITY OF 


TOLEDO 


Tenure-Track Assistant Professor Faculty Position in 
Cellular or Molecular Immunology 


The Department of Biological Sciences in the College of Natural Sciences and Mathematics at the 
University of Toledo seeks to fill a tenure-track assistant professor faculty position. Appointment at a 
higher rank will also be considered. Applicants are required to hold a Ph.D. with post-doctoral experience 
in Biological Sciences or a related field. Candidates in cellular or molecular aspects of immunology will be 
given priority. However, outstanding candidates in other areas of cellular or molecular biology will also be 
considered. It is important that candidates are able to collaborate with other faculty within the Department. 
Innate and other host disease defense mechanisms are part of our Departmental core research strengths, 
in addition to Cell Architecture and Dynamics, cell division/cancer biology, neuroscience, and regulation 
of gene expression, unified by a common interest in cell signaling. Other Departments in the College of 
Medicine on the near-by Health Science Campus of the University complement these areas. Facilities 
include a modern research complex with sophisticated laboratories and access to outstanding research 
instrumentation including, flow cytometers/cell sorters, confocal and super-resolution microscopes, and 
a new transmission electron microscope. The successful candidate is expected to advance excellence in 
the Department of Biological Sciences at the University of Toledo by establishing an externally-funded 
research program and contributing to undergraduate and graduate education. The Department offers 
B.A., B.S., M.S., and Ph.D. degrees. 


Additional information is available on the Departmental website: www.utoledo.edu/nsm/bio/index.html. 
Salary and start-up funds are competitive. Review of applications will begin immediately and continue 
until the position is filled. The starting date for this position will be August 2020. Interested candidates 
must apply at https://jobs.utoledo.edu with a letter of application, curriculum vitae, statements of research 
and teaching interests, and the names and contact information for three referees. Email inquiries may be 
directed to Lirim Shemshedini at LIRIM.SHEMSHEDINI@utoledo.edu or Scott.Leisner@utoledo.edu. 


The University of Toledo is an Affirmative Action Equal Opportunity Employer. All qualified 
applicants will receive consideration of employment without regard to race, color, religion, sex, 
national origin, disability, or protected Vets status. Qualified women and minorities are especially 
encouraged to apply. In addition, we are particularly interested in receiving applications whose 
research focuses on underrepresented populations and/or who have experience mentoring 
underrepresented minority students. The University of Toledo is committed to achieving inclusive 
excellence through a diverse workforce. 
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THE STATE UNIVERSITY 
OF NEW JERSEY 


Rutgers-The State University of New Jersey 
seeks applicants for appointment at the rank 
of Assistant Professor/Associate Professor/ 
Professor, Basic Science in the Department 
of Pathology, Immunology and Laboratory 
Medicine at the Rutgers-New Jersey Medical 
School located in Newark, New Jersey. This 
is a full-time Assistant /Associate or full 
Professor position. This position will support 
the research mission within the department, as 
well as the School. The research areas include 
Infectious Diseases, Cancer and Immunology. 
The successful candidate is expected to develop 
a strong independent research program within the 
department. In addition, he/she will participate 
in education and activities supporting research 
and scholarship. The successful candidate 
will bring in extramural funding with research 
interests and strength in Infectious Diseases, 
Immunology, and Cancer to complement and 
further the Department’s research mission. The 
successful candidate will initiate translational 
research in collaboration with the clinical 
faculty and participate in teaching of the 
RBHS-Graduate School of Biomedical Sciences 
(GSBS). Minimum education and experience: 
MD, PhD, or MD/PhD. Extramural funding is 
expected in the research area(s) of infectious 
Diseases, Immunology and Cancer. Teaching 
experience. Applications or inquiry should be 
submitted to the following email address: chen. 
liu@rutgers.edu. 


Rutgers University is an AA/EEO Employer: 


SOMETIMES 

THE GRASS REALLY 
IS GREENER 
SOMEPLACE NEW. 


Q Find your next job at ScienceCareers.org 


There’s scientific proof that when b 
with what you do, you’re better at what you do. Access 
career opportunities, see who’s hiring and take 
advantage of our proprietary career-search tools. 
Get tailored job alerts, post your resume and manage 
your applications all in one place: sciencecareers.org 


Science Care 


FROM THE JOURNAL SCIENCE 


U.S. POSTAL SERVICE 


Statement required by the Act of 12 August 1970, 
Section 3685, Title 39, United States Code, showing 
the ownership, management, and circulation of: 

1-9. Science, Publication No. 0036-8075, is published 
weekly on Friday, except the last week in December, at 
1200 New York Avenue, N.W., Washington, DC 
20005. Date of filing: 02 October 2019. This is also the 
address of the publisher, the editor, and the managing 
editor, who are, respectively, Bill Moran, Holden 
Thorp, and Monica M. Bradford. 

10. The owner is the American Association for the 
Advancement of Science, 1200 New York Avenue, 
N.W., Washington, DC 20005. Stockholders: None. 

11. Known bondholders, mortgages, and other security 
holders owning or holding 1 percent or more of total 
amount of bonds, mortgages, or other securities: None. 
12. The purpose, function, and nonprofit status of this 
organization and the exempt status for federal income 
tax purposes have not changed during the preceding 12 
months. 

13-15. The average number of copies of each issue 
during the preceding 12 months is (A) Total number of 
copies printed: 62,943; (B) Paid circulation: 58,402; (1) 
Paid/Requested outside-county mail subscriptions 
stated on form 3541: 51,244; (2) Paid/Requested in- 
county subscriptions stated on form 3541: 0; (3) Sales 
through dealers and carriers, street vendors, counter 
sales: 7,151. (4) Other classes mailed through USPS: 7; 
(C) Total paid circulation: 58,402; (D) Free distribu- 
tion: samples, complimentary, and other free copies: 
3,352; (1) Outside-county as stated on form 3541: 
2713; (2) In-county as stated on form 3541: 0; (3) 
Other classes mailed through the USPS: 2; (4) Free 
distribution outside of mail carrier or other means: 637; 
(E) Total free distribution: 3,352; (F) Total distribution: 
61,754; (G) Copies not distributed: 1,190; (H) Total: 
62,944; (I) Percent paid and/or Requested Circulation: 
94.6%. 

Actual number of copies of single issue (9/20/2019) 
published nearest to filing date are (A) Total number of 
copies printed: 60,867; (B) Paid circulation: 56,686; (1) 
Paid/Requested outside-county mail subscriptions 
stated on form 3541: 49,906; (2) Paid/Requested in- 
county subscriptions stated on form 3541: 0; (3) Sales 
through dealers and carriers, street vendors, counter 
sales: 6,774; (4) Other classes mailed through USPS: 6; 
(C) Total paid circulation: 56,686; (D) Free distribu- 
tion: Samples, complimentary, and other free copies: 
3,181; (1) Outside-county as stated on form 3541: 
2,749; (2) In-county as stated on form 3541: 0; (3) 
Other classes mailed through the USPS: 2; (4) Free dis- 
tribution outside of mail: Carrier or other means: 430; 
(E) Total free distribution: 3,181; (F) Total distribution: 
59,867; (G) Copies not distributed: 1,000; (H) Total: 
60,867; (I) Percent paid and/or Requested Circulation: 
94.7%. 

I certify that the statements made above are correct and 
complete. (signed) Bill Moran, Publisher. 


ScienceCareers 


FROM THE JOURNAL SCIENCE T[AYAAAS 


Follow us for jobs, 
career advice & more! 


@ScienceCareers 


Ei /ScienceCareers 


Science Careers 


ScienceCareers.org 


RWTHAA Jiilich Aachen 

Wt eee JARA ailance q vULiCn 
Focusing Expertise - Shaping the Future: The Jiilich Aachen Research Alliance (JARA) 

is an innovative cooperation model between RWTH Aachen University and Forschungs- 
zentrum Jiilich. This Alliance brings together an internationally respected university of 
science and technology and one of the leading national research centres in Europe. 


As a member of the Helmholtz Association, Forschungszentrum Julich makes an effective 
contribution to solve major challenges facing society in the fields of information, energy, and 
bioeconomy. They focus on varied tasks in the area of research management and utilize large, 
often unique, scientific infrastructures. Come and work with around 6,100 colleagues across 
a range of topics and disciplines at one of Europe’s largest research centres. 


As one of Germany’s Universities of Excellence, RWTH enjoys a global reputation for 
high-quality teaching and research. The university seeks to drive the convergence of 
knowledge, methods and findings from its different research fields and to integrate this 
in-depth knowledge into its interdisciplinary profile areas. RWTH’s dynamic and creative 
environment is characterized by efficient networks, institutionalized collaboration and 
the strong innovative capacity of RWTH Aachen Campus, one of the largest technology- 
oriented research landscapes in Europe. 


The research performed at the Peter Griinberg Institute (PGI) ranges from physical 
concepts and emerging materials to novel nanoelectronic devices. It is an active partner in 
the Cluster of Excellence “Matter and Light for Quantum Computing” (ML4Q; cooperation 
by the universities of Aachen, Cologne, and Bonn, as well as the Research Center Julich) 
and the European “Quantum Flagship”. Supported by the Julich Supercomputing Centre 
(USC) and the Helmholtz Nanoelectronic Facility (HNF), a 1000 m* clean room, PGI will 
further strengthen its activities on quantum information. It anticipates the next level by 
harbouring the Helmholtz Quantum Center (HQC), home of the newly founded institute on 
quantum computing. In this context RWTH Aachen University and Forschungszentrum 
Julich are jointly seeking a 


Director for the PGI - Institute for Quantum 
Computing (PGI-13) appointed as Professor 


for Quantum Computing (W3, Jiilicher 
Modell) at RWTH Aachen University, 
Department of Physics 


In close cooperation with other national and international groups we are initiating a center 
of expertise on fundamentals of quantum computing. Within this effort the goal of the new 
institute is the development of experimental quantum processors. The focus should be on 
multi-qubit circuits using scalable solid state qubits. Areas of interest include the coupling 
of qubits, high fidelity and scalable qubit control, as well as system integration. The 
institute should seek strong synergies with existing initiatives within JARA, the planned 
Helmholtz Quantum Center and the Cluster of Excellence ML4Q. 


Teaching requirements (2 hours/week) could, for example, be fulfilled with courses in 
quantum science and technology within the physics programs at RWTH Aachen. A doctoral 
degree and an exemplary record of research achievements at the group leader level are 
required. 


RWTH Aachen University and Forschungszentrum Julich are equal opportunities employers 
and pursue a policy of excluding all types of discrimination. Applications from women 
scientists are particularly welcome. Applications from women will be given preference in 
the case of equal suitability, qualifications and experience, unless special reasons 
concerning the person of a male candidate outweigh these considerations. Both 
institutions also are family- and disability-friendly and offer support for dual career couples. 


Applications should be in English and should be sent with the usual documentation (CV, 
copies of certificates, list of publications, teaching experience, brief summary of previous 
research activities including a list of third-party funding, research concept for the advertised 
position), by the 15-12-2019 to (e-mail preferred): 


Board of Directors of and _ Deanof the Faculty of Mathematics, 
Forschungszentrum Jiillich GmbH Computer Science and Natural Sciences at 
D-52425 Julich RWTH Aachen University, Prof. U. Simon 
berufungen@fz-juelich.de D-52056 Aachen 
application@fbi.rwth-aachen.de 


Further information can be found at: www.jara.org/jara-fit/PGI-13 
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Escaping ‘The Waiting Place’ 


n Dr. Seuss’s classic book Oh, the Places You'll Go! he warns us that sometimes in life we may 
“grind on for miles across weirdish wild space, headed, I fear, toward a most useless place. The 
Waiting Place.” Little did I know it, but after I emerged from the “weirdish wild space” of gradu- 
ate school, The Waiting Place was exactly where I was headed. I had accepted a postdoc position 
to develop new skills while embarking on what I anticipated would be a quick, productive search 
for an industry job. Instead that search dragged on, filled with false hopes and disappointments. 
I often felt I was going nowhere. But now that I’ve passed the 1-year mark in my first position outside 
of academia, I can look back and say that I learned a valuable skill: the ability to wait actively. Here 


are three strategies I found most helpful. 


SEEK GUIDANCE. As a _ freshly 
minted Ph.D., I had plenty of 
material for my resume, but I 
wasn’t sure how to pitch my 
skills to potential employers. To 
address this, I sought out a ca- 
reer coach who specialized in 
helping people with Ph.D.s in 
science and engineering. Hiring 
a professional consultant wasn’t 
cheap, but I viewed it as an 
investment in another valuable 
skill: marketing myself. His guid- 
ance not only improved my re- 
sumes, but also gave me a better 
general understanding of how to 
present myself in meetings with 
recruiters and in interviews. I 
also took advantage of many free 
opportunities for career advice, 
including university-sponsored 
career forums, skills assessments 
with an on-campus career coun- 
selor, and numerous online resources for job seekers. 


MEET NEW PEOPLE. At first, I felt that I didn’t know “the right 
people” to help me get a job, especially because I was look- 
ing for an industry position and my contacts were mostly in 
the academic world. However, once I started to look, I was 
surprised to realize how many second-degree connections— 
friends of friends—I actually had in industry. I reached out 
to many of them and asked to talk about their work. I also 
asked friends and colleagues for introductions to people in 
their networks. And I found ways to meet people outside 
my niche research area, for example by attending presenta- 
tions by startup companies, meetings of technology interest 
groups, and on-campus socials. 

It sometimes felt awkward to put myself out there, but 
networking helped me learn about many companies and 


“As a freshly minted Ph.D.... 
| wasn’t sure how to pitch my skills 
to potential employers.” 


industries. It also led to job inter- 
views, including with a Silicon Val- 
ley tech giant, a premier children’s 
hospital, and a highly ranked lib- 
eral arts college. Organizations 
like to hire people they can trust, 
so having someone on the inside 
vouch for you can help you rise to 
the top of the applicant stack and 
get an interview. 


PLAY THE LONG GAME. In the end, 
I got a job I didn’t even apply for. 
During my first year as a post- 
doc, I applied for an opportunity 
I found on an online job board. 
This led to a promising email ex- 
change, and I had a very positive 
interaction with one of the com- 
pany’s researchers at a conference. 
Unfortunately, the opportunity 
fell through. Another long year 
passed. I followed up with the 
company once or twice, but mostly I pursued other leads. 
Then, out of the blue, the researcher I had met contacted 
me about a new opening. It turned out I was a perfect 
fit for the company’s current projects, and after a couple 
short phone calls I had an offer. It took almost 2 years to 
come to fruition, but the effort I had invested in build- 
ing and maintaining a relationship with the company 
ultimately paid off. 

The Waiting Place was frustrating, but I learned it 
doesn’t have to be useless. Perseverance pays off. As Dr. 
Seuss reminds us: “And will you succeed? Yes! You will, 
indeed! (98 and 3/4 percent guaranteed.)” 


Nathan Pickle is a research scientist at CFD Research 
Corporation in Huntsville, Alabama. Do you have an interesting 
career story to share? Send it to SciCareerEditor@aaas.org. 
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